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Abstract 



In this thesis I consider the general problem of how to make the best possible phase measurements 
using feedback. Both the optimum input state and optimum feedback are considered for both single- 
mode dyneQ measurements and two-mode interferometric measurements. I derive the optimum 
input states under general dyne measurements when the mean photon number is fixed, both for 
general states and squeezed states. I propose a new feedback scheme that introduces far less phase 
uncertainty than mark II feedback, and is very close to the theoretical limit. I also derive results 
for the phase variance when there is a time delay in the feedback loop, showing that there is a 
lower limit to the introduced phase variance, and this is approached quite accurately under some 
conditions. I derive the optimum input states for interferometry, showing that the phase uncertainty 
scales as iV _1 for all the common measures of uncertainty. This is contrasted with the \j0} z state, 
which does not scale as iV -1 for all measures of phase uncertainty. I introduce an adaptive feedback 
scheme that is very close to optimum, and can give scaling very close to N -1 for the uncertainty. 
Lastly I consider the case of continuous measurements, for both the dyne and interferometric cases. 



1 Dync measurements arc those based on continuous measurement of field quadratures, including heterodyne, 
homodyne and more general adaptive measurements. 
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Chapter 1 

Introduction 



In this study I consider the general problem of how to most efficiently measure phase. This is 
a very important subject, as many high precision measurements are based upon measurements of 
phase. In particular, the current search for gravitational waves requires extremely accurate phase 
measurements, where new approaches that surpass the standard quantum limit may be necessary 
to obtain useful results H. 

Unlike most other quantities that we would wish to measure, it is not possible to measure phase 
directly. We must measure phase indirectly, and this almost always introduces an extra uncertainty 
beyond the intrinsic uncertainty in the phase of the mode. In general, it is possible to improve 
measurements by introducing an auxiliary phase shift. In the case of homodyne measurements, this 
phase shift would be based upon the previous knowledge about the phase. In this study I consider 
the case that the phase is unknown, and instead the auxiliary phase is adjusted based on the data 
obtained during the measurement. I investigate schemes for adjusting the auxiliary phase so as to 
introduce the minimum possible phase uncertainty. 

Another aspect of the problem of efficiently measuring the phase is the state itself. Every state 
with finite energy will have some intrinsic uncertainty in the phase. We would wish to optimise the 
state so that it has the minimum possible intrinsic phase uncertainty, or alternatively so that it gives 
the minimum phase uncertainty for some specific phase measurement scheme. 

Before I discuss these problems, I will briefly review the theory behind the description of phase 
and phase measurements. 



1.1 The Description of Phase 

Classically there is no ambiguity in the definition of phase. A general propagating sinusoidal wave 
can be described by 

ip(x,t) = Asin(kx - wt + </>), (1.1) 

where A is the amplitude and <f> is the phase. The phase is not a quantity that can be measured 
directly; however, it is just a real number and can be determined unambiguously from the variation 
of ijj(x,i) (provided the amplitude is constant and nonzero). 

For a monochromatic electromagnetic field propagating in the k direction, we can express the 
electric field as 

E(R,t) = E sin(k-R- ujt + <f>). (1.2) 
In quantising the electric field in the Heisenberg picture, the operator for the electric field is 



^ (R ' t] = V 277 (ee i(k R -*' t) a + e*e- l(k - R -^a^ , (1.3) 

where fi is Planck's constant divided by 2-7r, eo is the dielectric permittivity for a vacuum, V is 
the quantisation volume, e is the polarisation vector and a and a) are the annihilation and creation 
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operators respectively. The variation of the electric field in time and space, as well as the polarisation 
direction are contained in this field operator, but the amplitude and phase of the field are contained 
in the state. 

The annihilation and creation operators have the commutation relations 



[a,a f ] = 1, 
[a, a] = Uv , aH = 0. 



(1.4) 
(1.5) 



I will generally use the hat over variables to indicate operators when they are used in both an 
operator and non-operator sense (like E) , but not when the variable is used only as an operator (like 
a). 

The quantum mechanical decription of phase was first considered by London ^ |^] and Dirac Q] . 
In general, in quantum mechanics we wish to represent physical quantities by Hermitian operators. 
Unfortunately, this approach produces difficulties when applied to phase. For example, consider the 
result for a coherent state 

(a) = e**VW. (1.6) 
From this, an obvious way of defining an operator for the phase is by 



where N is the usual operator for the photon number, 

N = a) a. 



(1.7) 



(1.8) 



The phase operator defined in this way is equivalent to that considered by Dirac ^ . It is easily seen 
that the phase operator defined in this way is not Hermitian. 

This definition of the phase operator produces other problems. For example, Dirac derives the 
commutation relation 

[Nj]=i. (1.9) 



This would seem to imply the uncertainty relation |3| 

ANAcj) > |. 



(1.10) 



This uncertainty relation does not make sense, because the uncertainty in N can become arbitrarily 
small, whereas the uncerta inty in <fi can not be over 7r. In fact, this uncertainty relation is not 
necessarily implied by Eq. (O), because the phase operator is not Hermitian. Another problem is 
that, for a number state, the expectation value of the commutator should be zero rather than i. 

One resolution of this problem is that although it is not possible to define a Hermitian phase 
operator, it is possible to define Hermitian sine and cosine operators that give valid uncertainty 
relations || Qj. These Susskind-Glogower operators satisfy the commutation relations 



cos 4>,N —i sin < 
sin <fi, N 

The corresponding uncertainty relations are 



-l COS( 



A7VA cos (j) > \ 
ANA sin^ > \ 



sine 



( cos ( 



(1.11) 

(1.12) 

(1.13) 
(1.14) 



These uncertainty relations make sense, because in the limit of small number uncertainty the expec- 
tation values on the right hand side also become small, so these uncertainty relations do not imply 
ridiculously large values of A cos <j> or A sin (f>. 
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Unfortunately there are also problems with these operators. For example, we find that for the 
vacuum state {cos 2 8) = \ ||. As the vacuum state should have a uniform phase distribution, we 
would expect that (cos 2 9) is equal to h. In addition, using these operators we find (exp(ip9)) = 
for all integers p > 0. This implies a uniform phase distribution, which is what we would expect, 
but is inconsistent with the result obtained for (cos 2 9) . 

Another approach to finding a Hermitian phase operator is the Pegg-Barnett formalism g, [h]. [Li] . 
The basis of this formalism is to put an upper limit s on the photon number, then take the limit as 
s tends to infinity. The reference phase states are taken to be 

s 

\9 m ) s = (s + lJ-Va^exptinflmJIn), (1.15) 

where 

277Z7T 

0m = 0o + — r, m = 0,l,...,s (1.16) 
s + 1 

and 9q is an arbitrary constant. 

Then a Hermitian phase operator is defined by 



^ 9m 1 9m) ss (9m \ 

m=0 



(1.17) 



Note that for this operator the phase states \9 m )s are clearly eigenstates with eigenvalues of 9 m - In 
terms of the number basis, the operator is 

X -a i S7r i 2tt x % exp[z(j - k)0 o ] \j)(k\ 

^ ~ °° + 7TT + 7+1 2-t exp{*( 3 -k)2n/( S + ^ 

In the Pegg-Barnett formalism, the limit of s — > oo is taken after expectation values have been 
determined. 



It is also possible to take the limit s — > oo of Eq. (1.18), which gives 



X a ^ ^ exp[z(j-fc)6> ]|j)(fc| 

^ = Oo + n + ^ ^— ^ . (1.19) 

This operator was also considered before the Pegg-Barnett formalism was developed [|l2| [H| [L4| . 
This would appear to be a Hermitian operator that can be used to describe phase; however, there 
are problems with this operator. This operator leads to different expectation values than given by 
the Pegg-Barnett formalism (where the expectation values are taken before the limit s — > oo). 

The problem is that ( 1.18j ) converges to ( 1.19 ) only weakly |L5). The weak limits of operators do 



not preserve the operator algebra, for example, the weak limit of cj) 2 is not (fx^. One result of this 
is that, for the vacuum state, (4> 2 X) ) = tt 2 /6 [[l6| |l7y . For a uniform phase distribution, the result 
should be 7r 2 /3, which is that obtained from the limit of (4> 2 ). 

Fortunately, in this study we do not require an explicit phase operator, and usually all we need 
to know is the phase variance, or at most the probability distribution. For this what we want is the 
probability operator measure, or POM. 



1.2 Probability Operator Measures 

In quantum mechanical systems, the most general way of obtaining the probability of some mea- 
surement result E is by the expectation value of an operator F(E), i.e. 

P{E) = Tr[pF(E)}, (1.20) 

where p is the state matrix for the system. If the set of all possible measurement results is Q, it is 
evident that P(f2) = 1 for all p, which implies that F(fl) = 1. Thus F(E) can be called a probability 
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operator, and the mapping E i— ► F defines a probability operator measure (POM), sometimes also 
called a positive-operator- valued measure (POVM), on fl jl8| This method does not require a 
specific operator to represent the quantity that is being measured. 

This is quite different to the simple method for pure states, where the probability is given by the 
square of the inner product between the initial and final states. To see the similarity between the 
two methods, recall that in the simple method we represent the physical quantity being measured 
by a Hermitian operator, for example R. This has associated eigenvalues and eigenvectors r and 
\ip r )- After a measurement yielding the result r the system is in state |?/v), and the probability for 
this result is KVvlVOI . 

Alternatively we can define the projection operators 

Ur = |Vr)<^r|. (1-21) 
The probability of obtaining the result r is then 
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Pr = IWV|V>I 

= (V# r )<Vv|V>> 

= (V>|n r |V>. (i-22) 



The normalised state after the measurement is then 



I*.) = (1.23) 



For more general measurements, we can replace the projection operator with a more general 
operator Q r , called the measurement operator. Then the probability is given by 

p r = (v>|nj:n r |v)- (i-24) 

Note that in this case Oj.O r does not necessarily simplify to fi r , as is the case for projection operators. 
The normalised state after the measurement is given by 

\A) = (1.25) 



For the most general case we replace £t\Sl r in Eq. ( 1.24 ) [or II r in Eq. ( 1.22 )] with a general 
operator F r . The probability is then determined using 

P r = {ik\F r \il>). (1.26) 



Clearly, the generalisation to this for mixed states is Eq. (1.20). 

For phase measurements, the probability distribution for the measurement result can be de- 
termined using the POM F{<p). This approach was first considered be Helstrom |[{|, and is also 
considered in Refs [^0[ ^l| . If the phase measurement treats all phases equally, it should be invariant 
under a phase translation 

R{9)F{cp)R{-e) = F{4> + 9) (1.27) 
where R(0) = exp(za^a^) is the phase translation operator. Now F((j>) has the general expansion 

oo 

F{<f)= \n){m\F nm {4>). (1.28) 

n , m — 

The phase shift invariance condition gives 

oo 

F(0 + 6)= e tatae \n)(m\e~ tafa9 F nm (0) (1.29) 

n,m— 

oo oo 

Y, \n)(m\F nm (<t> + 0)= £ e <ne |n)<m|e-^ F„ m (0) (1.30) 

n,m— n,m— 
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This means that we must have 

F nm {4> + 9) = e^ n -^ 6 F nm {4>). (1.31) 
This implies that F nm (<fi) must have the form 

F mn (<t>) = ^'("'♦t. (1.32) 

A factor of 1/(2tt) has been added for normalisation. Therefore the general form of F(<j>) for a shift 
invariant phase measurement is 



1 oo 

F (<P) = ir E \n){m\e* n - m ^H nrn . (1.33) 

n,m— 

For the integral of the probabilities to equal 1, we must have 

F(<f>)d<j> = I. (1.34) 



Applying this to Eq. ( 1.33 ) above we find that 

oo 

J2\n)(n\H nn = l. (1.35) 

n=0 

This means that the diagonal elements H nn must all be equal to 1. 

In addition there is the condition that the probability given by Tr[pF((f>)] always be real and 
positive. This, together with the above result means that all of the H nm must have absolute values 
between and 1. Usually these are all assumed to be real and positive; however, we will see in 
Sec. [Tt] that they need not be. 

In Ref. p3| it is shown that the additional condition that a number shifter does not alter the 
phase distribution gives H nm = 1, corresponding to the POM 

oo 

F can (0) = — V e i{ - n - m ^\n){m\. (1.36) 

n,m— 

This POM corresponds to a canonical phase measurement |2^] . Ref. [EJ also derives this POM using 
the maximum likelihood approach. In general, real measurements will give smaller values of H nm , 
and the closer these are to 1 the better the phase measurement is. This POM is the best possible 
for the main states that I will be considering in thesis, but not for every possible input state. The 
best POM as derived in J]l| is actually dependent on the input state; this is discussed further in 
Sec. |/| 

Note that we may express the POM ( |l.36| ) in the form 

F can (£) =|</>> <0|, (1.37) 

where 



\ ^ e in<t> 



(2tt) 1 /2 ^ 

; n=0 



e r n) 



(1.38) 



These states are eigenstates of the Susskind-Glogower operators cos0 and sin0, and may be in- 



terpreted as phase states. We therefore see that the POM of Eq. (1.36) is consistent with the 
Susskind-Glogower formalism. It can also be shown |^3| that identical results are obtained using 
this POM as using the Pegg-Barnett formalism. In addition, London's treatment of phase ^ || is 
also equivalent to this. 
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Figure 1.1: A phase probability distribution that is narrowly peaked at ±7r. 



The fact that these different approaches to phase give equivalent results is a compelling reason 
to consider this to be an accurate description of phase. Nevertheless, there is another class of 
descriptions of phase which is not equivalent to this: those based on an 'operational' approach 
pi] , p5| , p6| , p7| , ^8| , |29| . Here the phase is defined as the quantity measured by a particular experiment. 
The disadvantage of this approach is that the description of phase is dependent on the experiment. 
For further discussion of the problems involved in the description of phase, see Refs [BC| pl[. 



1.3 Phase Variance 

In this work we are primarily interested in the phase variance, and not the total phase distribution. 
Since phase is a cyclic variable the usual definition of phase variance, as given by 

var(0) = (0 2 )-<0) 2 , (1.39) 

does not work well. For example, we would usually expect the variance to go to infinity in the limit 
of a flat distribution. For phase, as the distribution is limited to a region of length 2-7T, the variance 
will be finite for a flat distribution. This means that it is not possible to give an uncertainty relation 
in the usual way. In addition, if the mean of the distribution is at one bound of the phase, the phase 
variance obtained from this definition will be artificially large. For example, the distribution shown 



in Fig. IT is narrowly peaked, but Eq. ( 1.39 ) will give a very large variance 



These problems can be solved by using the Holevo phase variance |32| 

V(0) = |(e^>|~ 2 -1. (1.40) 

This definition is naturally modulo 2n, and in addition if the distribution is flat then (e*^) will be 
zero, so the variance will be infinite. For this definition of the variance, there is the uncertainty 
relation 

V((f>)AN 2 > \. (1.41) 



In contrast, using Eq. (1.39) for the phase variance there is no uncertainty relation of this form 



because, for example, a number state has zero number variance but a finite variance under Eq. (1.39) 
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Also, for a distribution that is sharply peaked, we find 

vM = i<e^>r 2 -i 

m / P i{4>- {<<>)) 



;«(<M*»\ 

« l(l+*(0-(0))-±(0-(0» 2 )l 
= ll-l((^-^»*)l- a -l 
= [l-I((0-(0))2)]- 2 -l 
«l + ((^-(^)) 2 )-l 
= (W-M) a >. 



(1.42) 



Therefore this measure of the phase variance is approximately the same as the usual estimate of the 
variance for sharply peaked distributions. 

The definition of the Holevo variance ( 1.40| ) can be used when the exact phase distribution is 
known. In obtaining numerical results, it is is generally the case that the exact distribution is 
unknown, so the Holevo phase variance must be estimated from a set of samples. In the case of the 
standard variance, it is a standard result in statistics that the variance calculated from the samples 

by 

i M 

i=l 

where M is the number of samples, is a biased estimator for the variance of the distribution. The 
unbiased estimator is where the dividing factor is M — 1, i.e. the unbiased estimator is 



M 



M 



{4>i - 



(1.44) 



This is because taking the average of the data removes one degree of freedom. If the mean of the 
distribution is known, then an unbiased estimator for the variance is 



1 M 
M ^ y 



(1.45) 



where (f> is the known mean for the distribution. If the measurements are unbiased, then the mean 
for the distribution will be the same as the actual phase, tp. If we use the actual phase, then the 
estimator becomes 

1 M 

fx. ,„\2 



M ^ 

i=l 



fa - <pY 



(1.46) 



The situation is analogous for the Holevo phase variance. If the measurements are unbiased, we 
can estimate the Holevo variance from the samples by 



Re 



1 



1=1 / 



(1.47) 



It is easy to see that when the phase distribution is narrowly peaked, this simplifies to Eq. (1.46). 
On the other hand, if the measurements are biased, then this will not be an estimator for the Holevo 



variance, and Eq. (1.46) will not be an estimator for the standard variance. 

For measurements that may be biased the variance is not a good measure of the accuracy of the 
measurement. An arbitrary bias may be added to the distribution without altering the variance, as 
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the deviation is measured from the average of the distribution, rather than the actual phase. For a 
biased measurement scheme, it is more appropriate to define the standard variance as 

var^) = ((0-<^) 2 ). (1.48) 

Here the subscript p has been used to distinguish this variance from the usual definition. For 



this definition, biased distributions will give larger variances. Also Eq. (1.46) will be an unbiased 
estimator for this modified variance. 

The analogous definition for the Holevo variance is 

V v (4>) = [Re (eW-^Yl ~ 2 - 1. (1.49) 



In the case that ip — 0, or <f> is the deviation from the system phase, this definition of the variance 
simplifies to 

VbW=[Re<e < *)]~ 2 -l. (1.50) 

Similarly to the case for the standard variance, this definition will give larger variances for biased 
distributions, whereas the standard definition does not distinguish biased and unbiased distributions. 



Holevo variance as given by Eq. (1.40) 



If the measurements are biased, then Eq. (1.47) will be an estimator for this variance, but not the 



In this thesis I usually consider phase distributions that are unbiased, so this modified definition 
of the Holevo phase variance is not required. For the problem of finding optimal phase estimates, 
however, this altered definition is necessary in order to eliminate the possibility of biased phase 
estimates. 

Now if we consider arbitrary measurements on an arbitrary pure state \ip), the probability dis- 
tribution for the phase is given by 

P(4>) = Tr[pF(ct>)] 

1 OO 

= ^ E mn)(m\^ n - m ^H nm . (1.51) 

n,m=0 

In order to determine the Holevo phase variance we must determine Evaluating this gives 

7T 

— 7T 

\ OO 

= J 2^ E ^\n)(™Wy {n+1 ~ m) *H nm d0 

_ 7r n,m=0 
oo 

n,m— 
oo 

= 2<V>|n)<n + l|V)fl n ,f.+i. (1-52) 

n=0 

Therefore the phase variance does not depend on all the elements H nm , but only on the off-diagonal 
elements iJ„ >n +i. As we are often only interested in the phase variance resulting from a measurement 
scheme, this greatly reduces the number of variables required to describe the measurement. Note 
that the condition |i? nm | < 1 means that this expression gives |(e l< ^)| < 1, as we would expect. 
For canonical measurements we have = 1, so this simplifies to 

oo 

< e# )can = E^><™+l|^). (!- 53 ) 

n=0 
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For more arbitrary measurements, we will have H n ^ n+ i < 1, so it is convenient to define the vector 
h(n) by 

h{n) = l-H n , n+l . (1.54) 

In terms of this we obtain 

oo 

(e 4 *) = 5^(^|n)<n + l|V)(l-M»)) 

n=0 

oo oo 

= y]{ip\n){n + l\ip) - y]{j>\ri){n + l\ip)h(n) 

n=0 n=0 
oo 

= ( e# )ca„ " X>l">(n + W)h(n). (1.55) 

n=0 

If the photon number distribution is reasonably sharply peaked then we can typically replace h(n) 
by its value for the average photon number. Then we find 

<e^)«(e^) can (l-Mn)). (1-56) 

For large mean photon number, we find that h(n) <C 1 for most measurement schemes. Using this 
approximation, the Holcvo phase variance is 



^)-i(^Ld-^))r 2 -i 

= K^> c J" 2 ( 1 -^))- 2 -i 



«l<e < *> CM r a (l + 2fc(»))-l 

-K^)ca„r 2 + 2 M")-i 

= V can {cj>) + 2h{n). (1.57) 

Thus we see that to a first approximation the phase variance is equal to the intrinsic phase variance 
plus twice the value of h(n) for the average photon number. This means that the introduced phase 
variance is fairly independent of the input state (apart from the photon number) . 

For many types of real measurements, the value of h(n) decreases as some power of n for large 
n. We can therefore approximate it by 

h(n) w cn- p . (1.58) 

When this is the case, we can describe the measurement by just the two variables c and p, rather 
than the entire vector h(n), or the matrix H nm . 

1.4 Real Measurements 

In practice, it is not possible to directly measure phase. What we must do is infer the phase from 



measurements of some other quantity. The electric field, with the operator given by Eq. (1.3), is 
directly measurable. Ignoring the field direction, this operator is proportional to 

X* =ae- l *+aV*, (1.59) 

where 

$ = wi-k-R. (1.60) 

This is called the $ quadrature operator. For high frequency light $ varies far too rapidly for 
convenience, so we wish to measure the quadrature in a more convenient way. 

This can be done by combining the light with a strong local oscillator field at a beam splitter, as 



in Fig. 1.2. The local oscillator field provides a reference phase. The difference between the photon 
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Figure 1.2: Diagram of the apparatus for making a dyne phase measurement. The signal from the 
cavity and the local oscillator field are combined at a 50/50 beam splitter (BS) and the outputs are 
detected by photodetectors (PD). For adaptive measurements the signals from these photodetectors 
are processed by the digital signal processor, which determines a phase estimate and adjusts the 
electro-optic phase modulator (EOM) accordingly. 



numbers at the two photodetectors gives a measurement of the quadrature X<s>, where $ is the phase 
of the local oscillator field. 

In this study I consider measurements that take an appreciable time, so rather than just consider- 
ing the total photon counts at the detectors, it is possible to consider the instantaneous photocurrent. 
For this extended measurement we can alter the phase of the local oscillator during the measurement 
based on the data obtained so far. 

The standard technique for measuring a completely unknown phase is heterodyne detection, 
where the phase of the local oscillator is varied linearly. This is simple to do, as it simply means 
that the local oscillator has a slightly different frequency than the signal. This method suffers 
from the drawback that it introduces a fairly large phase uncertainty. It is possible to obtain more 
accurate measurements if we use a constant feedback phase $, that is close to the actual phase plus 
7r/2. This is called the homodyne technique. 

The major problem with the homodyne technique is that it requires knowledge of the system 
phase beforehand. The guiding principle behind adaptive phase measurements is that we wish 
to approximate homodyne phase measurements by varying the local oscillator phase during the 
measurement based on data obtained so far. We will see that it is not quite so simple, however, as 
we do not use the best estimate of the phase, and in rare cases will not use a phase estimate at all. 
I will use the general term "dyne" for heterodyne and homodyne phase measurements, as well as 
these more complicated adaptive phase measurements. 

In order to describe the adaptive measurements I will first introduce the notation. The numbers 
of detections at the two photodetectors in the time interval St are denoted SN + and SN-. The 
complex amplitude of the local oscillator will be denoted as 7. The difference photocurrent I(t) is 
then defined in terms of the noncommuting limits 

I(t) = lim hm SN+ ~l N - . (1.61) 
st^a | 7 |^oo I7I St 

This simplifies to 

I(t)dt = 2Re«a) e~^)dt + dW{t), (1.62) 

where dW(t) is an infinitesimal Wiener increment with variance dt. For a constant amplitude 
coherent state we have (a) = a. Note that (I(t)) = (X&), so this is essentially a measurement of the 
$ quadrature. 
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For more general states, (a) may have a systematic variation given by 

(a) = at oc \J u(t), (1.63) 

where u(t) is a mode function. In this case we alter the definition of I(t) 7 and scale a t and the time 
to obtain 

I(v)dv = 2Re(a t ,e- 4<E ')dw + dW(v), (1.64) 

similarly to the case for constant amplitude. The detail of how this is done is given in Ch. [| 
We then define the quantities A v and B v by 

V 

A v = [ e^I(u)du, (1.65) 



o 

v 

B v = - [ e 2l ^du, (1.66) 



The time is scaled to the unit interval, and for the values of A v and B v at v = 1 the subscripts are 
omitted. It is shown in Ref. [33| that the only relevant information from the measurement record is 
contained in these two variables. (B v does not explicitly depend on the measurement record, but it 
does depend on it implicitly if the feedback phase is varied based on the measurement record.) 

To see why this is so, firstly recall that the way the normalised state varies for measurement 
operator f2 r is 

Wr) = —f^- (1-67) 



Alternatively we can consider an unnormalised state |Vv); given by 

\A) = « (1.68) 

where the A r are called the ostensible probabilities. The actual probability is then given by 

P r = A r {4> r \^ r ). (1.69) 

This method is used in Ref. |53| to consider the evolution of the signal state under dyne measure- 
ments, with the ostensible probabilites chosen as those for a vacuum. 

It is found that the evolution of the unnormalised state depends on the photocurrent record and 
feedback phases only through the variables A v and B v . This means that the probability distribution 
for the photocurrent record up to time v, I[o,«)j is given by 

i J (I [ „,„ ) )=P (I[o,»))(i(A,5„)|i(i„,B„)), (1.70) 

where P (I[ 0i „)) is the ostensible probability distribution. Note that this does not prove that the 
probability distribution for the photocurrent record only depends on A v and B v , as the ostensible 
probability distribution may depend on the photocurrent record in some more complicated way. The 
ostensible distribution, however, is for a vacuum, and does not contain any information about the 
state. This means that all the information about the state from the photocurrent record I[o,„) is 
contained in A v and B v . 

To show this in a more rigorous way, consider some arbitrary parameter of the input state, x. 
This could, for example, be the phase or the photon number. Using Bayes' theorem we find that 
the probability distribution for this quantity given the photocurrent record I[o,d) is 



(1.71) 
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Here P{x) is the probability distribution for this quantity at the start of the measurement. It 
will be flat, as we are assuming that there is no knowledge about this parameter before the start 
of the measurement. The probability in the denominator is independent of x, so the probability 
distribution is therefore 

P(x\I [0tV) ) <x P{I [0lV) \x). (1.72) 

The ostensible probability distribution Pq(1< v \) contains no information about the state, and will 
therefore be independent of x. Thus we find 

P(x\I [0 , v )) cx $ V {A V ,B V )\$ V {A V ,B V )). (1.73) 

This means that the probability distribution for any parameter of the signal state only depends 
on the photocurrent record through A v and B v . This means, for example, that estimators for the 
phase should be functions of A v and B v . They can be functions of other variables, for example the 
initial coherent amplitude, if that is known, but they should depend on the photocurrent record only 
through A v and B v . 



1.5 Adaptive Phase Measurements 



For a coherent state with a = |a| e 4V , Eq. (1.64) becomes 

I(v)dv = 2 |a| cos(<^ - <f>)dv + dW(v). (1.74) 

In order for the measurement to be close to an ideal measurement of the phase, the cos function 
should be as close as possible to zero. This implies that its argument should be close to ±7r/2. 
Therefore it is best to take the local oscillator phase to be (p + ir/2, where (p is some estimate of the 
system phase. In that case 

I{v)dv = 2 \a\ sin(</3 - 0)dv + dW{v). (1.75) 

In the limit that \<p — tp\ <C 1, this becomes 

I(v)dv « 2 \a\ (<p - (p)dv + dW{v), (1.76) 

so the measurement is very close to an ideal measurement of the phase. The reason why there is 
always an excess phase uncertainty is because the sin function is not completely linear, so this is not 
exactly equivalent to a direct measurement of the phase. 

The basis of homodyne measurements is to use an estimate of the phase that is known before the 
measurement begins. For adaptive measurements, we only have information about the phase once 
the measurement has begun, and we use a phase estimate based on the data obtained so far. To see 
what we can use as a phase estimator, we can expand the variable A v to give 

A v — va — a* B v + ia v , (1-77) 

where 

V 

a v = ( e^ {u) - l7r/2 dW{u). (1.78) 



From this result it is simple to show that 

vA v + B V A* V = a(v 2 - \B V \ 2 ) + i(va v - B v a* v ). (1.79) 
Taking the expectation value gives 

(vA v + B v A* v ) ^ a (v 2 - \B V \ 2 ) . (1.80) 
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Table 1.1: The values of c and p for heterodyne, mark I and mark II measurements. 





c 


P 


heterodyne 


1/8 


1 


mark I 


1/8 


0.5 


mark II 


1/16 


1.5 



If the local oscillator phase is independent of the photocurrent record, then this is exact. In the case 
of feedback, B v may be correlated with a v , but this result should still be approximately true. This 
means that the phase of vA v + B v Al should be close to the phase of the signal. It is convenient to 
define the new variable 

C v = vA v + B V A* V . (1.81) 

This should not be confused with the variable C used in Ref. [Q, which is defined differently. 

For heterodyne measurements, the local oscillator phase is varying linearly. This means 

that the average value of e 2 **^ will be zero, so B v will be close to zero. This means that C v ~ A v , 
so arg A v will also be a good estimator of the phase. It is simple to show from this (3^] that the 
phase variance for heterodyne measurements on coherent states is approximately \ \ot\ 2 . As the 
intrinsic phase variance for coherent states is \ |ce| — 2 , this means that there is an introduced phase 
variance of kn~ . 

For more general measurements where B v ^ 0, a,rgA v will not be as good an estimator of the 
phase; however, it can still be used as an estimator. If (p = a,rgA v is used as the estimator for the 
phase in feedback to the local oscillator phase, we have a measurement scheme that can be analysed 
analytically. In the scheme considered in Ref. |3(| , arg A was the phase estimate used at the end of 
the measurement as well. 

It was shown in Ref. |}6) that these measurements give the canonical result if the system has 
at most one photon. For systems with large numbers of photons, however, the introduced phase 
variance scales as jTT 1 / 2 . This means that for large photon numbers this measurement scheme gives 
far higher phase variances than heterodyne measurements. 

It is possible to greatly improve on this result by simply using the best phase estimate at the end 
of the measurement, argC. This is called the mark II adaptive phase measurement scheme, whereas 
the scheme where arg A is used at the end of the measurement is called mark I. This measurement 
scheme introduces a phase uncertainty of ^n -3 / 2 []35| . This is a vast improvement over the mark I 
scheme, and also improves on the heterodyne scheme. 



Recall from Eq. (1.57) that the introduced phase variance is approximately 2h(n). This means 
that the variation of h(n) for the heterodyne, mark I and mark II measurement schemes is approxi- 
mately 

K ct {n) w in" 1 , (1.82) 
hi(n) » fn-°- 5 , (1.83) 
h u (n) « ^n- 1 - 5 . (1.84) 

To summarise this, the values of c and p for these three measurement schemes are as given in Table 



1.1 



The mark II measurement scheme still leaves questions, however. It has been shown |37j that 
optimum states have intrinsic phase variances that scale as 1.89 x n~ 2 . For these states the phase 
uncertainty that is introduced by mark II measurements will be far greater than the intrinsic phase 
uncertainty of the state. In addition it has been shown ]34| that the theoretical limit to the variance 
that is introduced by dyne phase measurements is |logn x TT 2 . This is not quite as good as the 
scaling for the intrinsic phase variance, but it is far better than the scaling for mark II measurements. 

Another issue is that the intermediate phase estimates used for mark II measurements are still 
argA„. As evidenced by the poor scaling for mark I measurements, these phase estimates are far 
worse than arg C v . This raises the question of whether phase measurements can be made closer to 
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Figure 1.3: The Mach-Zehnder interferometer, with the addition of a controllable phase $ in one 
arm. The unknown phase to be estimated is ip. Both beam splitters (BS) are 50/50. 



the theoretical limit by using better intermediate phase estimates. This is considered in Ch. ^, and 
it is shown that measurements can be made very close to the theoretical limit. 

Another factor in attempting to make the most accurate possible phase measurements is the 
input state. Rather than just considering states that are optimised for minimum intrinsic phase 
variance, the fairest way to evaluate the various measurement schemes is to consider states that are 
optimised for minimum phase variance under that particular phase measurement scheme. This is 
considered in Ch. [|. 

It is necessary to have a constraint on the state being optimised in order to avoid obtaining a state 
with indefinitely large photon number. The two main ways of constraining the state that is being 
optimised are to put an upper limit on the photon number and to specify the mean photon number. 
A third alternative is to only consider squeezed states. These states are far more convenient to work 
with, both theoretically and experimentally, and in addition they give results that are extremely 
close to those for the general optimisation problem for general measurements. All these alternatives 
are considered in Ch. 0. 



1.6 Interferometry 

The major alternative to dyne measurements is interferometric phase measurements. Here, rather 
than measuring the phase of a single mode, and assuming that the local oscillator field is sufficiently 
intense that it can be treated classically, we are measuring the phase difference between two modes, 
both of which are treated quantum mechanically. 

The most convenient way of considering this is via a Mach-Zehnder interferometer, as in Fig. [j~3| . 
Two input modes are combined at a beam splitter, after which each of the modes is subjected to a 
phase shift, and the two modes are recombined at a second beam splitter. Usually we would consider 
measurement of the phase difference between the two arms. Here for simplicity I consider the phase 
shift to be measured, ip, to be in one arm, and add a controllable phase shift, <&, in the second arm. 
This allows us to make adaptive measurements analogous to the case for dyne measurements. 

Note that if the first beam splitter is omitted, the configuration is identical to the configuration 
used to make dyne measurements of a single mode. The arm of the interferometer subjected to the 
phase shift $ is equivalent to the local oscillator mode, and the arm subjected to the phase shift ip 
is equivalent to the signal mode in the dyne case. The difference here is that we are now treating 
both modes quantum mechanically. 

The main role of the first beam splitter is that it creates quantum correlations between the two 
modes, which generally improves the phase properties of the state. For example, it is not possible 
to measure a phase difference between a number state and a vacuum state, but the two mode state 
produced after the beam splitter has good phase properties. 



1 . 7. EXPERIMENTAL IMPERFECTIONS 



15 



In general it is much easier to produce input states without quantum correlations between the 
modes, so the initial beam splitter is very useful. In this study, I consider two-mode input states with 
arbitrary correlations between the modes. In this case, the initial beam splitter is fairly superfluous, 
as it merely transforms the input state into another correlated state that could have been considered 
directly. I still include the initial beam splitter in this study, for consistency with previous work. 

It is well known that if we feed a state with a mean photon number of n into one arm, it is 
possible to obtain a phase variance scaling as n" 1 . This is analogous to the case of coherent states 
for dyne measurements, which are the easiest to produce and also give n _1 scaling. There have been 
several proposals for reducing the phase variance to n~ 2 . The first of these is that by Caves 0. 

Caves considers an interferometer with a coherent state in one arm, but in the other arm, rather 
than a vacuum state, a squeezed vacuum is used. The measurements considered are equivalent to 
homodyne measurements, where the phase difference is very close to 0. For the right squeezing 
parameter these measurements have a variance scaling as n~ 2 . Squeezed states were also considered 
by Bondurant and Shapiro [f38[ , 

Yurke et al. consider an input state that is a combination of input number states, 

\i/>) = i (\j) a \j) b + \j + l) a \j - 1)6) , (1-85) 

where the subscripts a and b indicate number states in ports a and b respectively. According to their 
analysis this state should give a phase uncertainty scaling close to N~ , where N = 2j is the fixed 
total photon number. Similarly to Ref. Q| the phase must be very close to to obtain this scaling. 
Yurke et al. also consider active interferometers, characterised by SU(1,1) rather than SU(2), as is 
the conventional Mach-Zehnder interferometer considered in this study. 

Holland and Burnett ^(J also considered the case where the total photon number is fixed. They 
considered the state with equal photon numbers in both input ports, \j) a \j)b- For this state the 
phase uncertainty scales as N , but again only for phases very close to 0. This state also has the 
additional problem that it gives results at and ±7r with equal probability, and therefore must be 
considered modulo ir. 

Sanders and Milburn [Elj, ^2) considered "optimal" measurements, for which the phase uncer- 
tainty is independent of the system phase. Unfortunately these measurements are derived from pro- 
jections onto the phase states, rather than a physical measurement scheme involving photodetectors. 
In Ch. |^ it is shown that it is not possible to implement these measurements using photodetectors, 
even allowing feedback. Therefore I will generally call these measurements ideal or canonical rather 
than optimal, and reserve the term optimal for the best possible physically realisable scheme. 

Sanders and Milburn considered the same input state as Holland and Burnett, \j) a \j)b- In 
Ref. |Q they show that the phase uncertainty for this state scales as iV _1 according to two common 
measures of the uncertainty. In Ch. || I show that this state has very poor scaling if we consider the 
phase variance. The optimal input states as evaluated using the Holevo phase variance are derived, 
and it is shown that all the common measures of the phase uncertainty scale as iV _1 for these states. 
Both the \j) a \j)b and -4= (|j') |i)& + \j + l)a|j — l)b) states are rough approximations of this state. 



1.7 Experimental Imperfections 

The majority of the work considered in this thesis ignores all experimental imperfections. This is 
because the main motivation for this work is to lay a theoretical foundation for how far phase mea- 
surements can, in principle, be improved, rather than to examine the limits to phase measurements 
using current technology 

Firstly there are a number of problems that are very specific to the apparatus. For example an 
inaccurate calibration in the phase shifter producing the phase $ will result in a corresponding error 
in the measured phase. These types of problems are relatively simple to analyse, and as they are 
very specific to the equipment used they will not be discussed here. 

One problem that is common to any phase measurement is inefficient photodetectors. No current 
photodetectors will register every photon. There are two main types of photodetectors with different 
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efficiencies. Firstly there are large amplitude photodetectors that do not need to distinguish the exact 
photon number. These are used for dyne measurements, where the photon counts are treated in the 
continuous approximation. These can be made very efficient, and the best current photodetectors 
have efficiencies around 98% [ff3f . 

The case of dyne measurements with inefficient detectors was considered in |35[| . For the case 
of coherent states, the analysis is simple, as an efficiency rj can be treated by changing the mean 
photon number from n to rjn. This means that, for mark II measurements, the phase variance to 
first order is 

1 ,!-»?, ™„- 3 



4a 2 Ar]a 2 



+ 0(a~ d ) (1. 



We will get the same result (to first order) for any other reasonably good phase feedback scheme 
where the introduced phase variance is of higher order than the intrinsic phase variance. The phase 
variance is in the form of the intrinsic phase variance plus an extra term due to the inefficient 
photodetectors. As explained previously, the introduced phase variance is generally independent of 
the input state (to first order). This means that 

AVM = ^ (1 . 87) 

will be the introduced phase variance due to the inefficient photodetectors for other states with 
reduced phase uncertainty. 

The second type of photodetectors is photon counters, that count photons one by one. This is 
the type required for interferometric measurements, where we wish to alter the feedback phase after 
every detection. These generally have far lower efficiencies, less than 90% p4| . This is a serious 
problem, as the analysis breaks down if even a single photon is missed. Taking account of a fixed 
probability of missing a photon would greatly complicate the analysis, and was not attempted in 
this study. 

Another problem, that is unique to phase measurements involving feedback, is the time delay in 
the feedback loop. This is arguably an even more fundamental problem than inefficient photodetec- 
tors, because the feedback loop cannot operate faster than the time it takes light to reach the phase 
modulator. This is a very serious problem for short pulses. For example, if the pulse is shorter than 
the distance between the phase modulator and the photodetector it is not possible to perform any 
sort of adaptive measurement. Adaptive measurements rely on the pulse being sufficiently long that 
the feedback phase can be altered during the passage of the signal. 

Even if the time delay is a small fraction of the length of the signal, there will be an appreciable 
increase in the phase variance. This was investigated by a highly simplified theory in Ref. |3j|, and 
in Ch. |3] the introduced phase variance is estimated in a far more rigorous way. 



1.8 Structure of the Thesis 

This thesis is structured around the problems of determining the optimum input states and mea- 
surement schemes for dyne measurements and interferometry. The optimum input states for dyne 
measurements are derived in Ch. ||, and the problem of performing optimum dyne measurements is 
discussed in Ch. [| Optimum input states for interferometry are discussed in Ch. ^, and optimum 
interferometric measurements are discussed in Ch. ||. These chapters form the main theme of this 
thesis. Some additional problems considered are time delays in Ch. ^| and continuous (rather than 
pulsed) measurements in Ch. fj]. 

As mentioned above there are three main alternatives when considering optimum input states 
for dyne measurements: 

1. An upper limit is placed on the photon number. 

2. The mean photon number is fixed, but the state is otherwise arbitrary. 
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3. The state is a squeezed state, and the mean ph oton num ber is fixed. 

These alternatives are considered in Sees 2A, 2/2 and respectively of Ch. g. In each case both 
canonical measurements and more general dyne measurements where h(n) ~ cn~ p are considered. 

The analytic results for canonical measurements and for general dyne measurements with an 
upper limit on the photon number were previously derived in Refs |37| , |3E| , [45f . These derivations 
are summarised here, as the method involved sheds light on the method of solution for the more 
complicated cases of general dyne measurements for fixed mean photon number and for squeezed 
states. The accuracy of all these results is evaluated by extensive numerical calculations. 

The problem of performing dyne phase measurements at the theoretical limit is too complicated 
to solve analytically, and Ch. || therefore relies upon numerical methods. The various feedback 
schemes are evaluated numerically by solving the stochastic differential equations (SDEs) for a large 
number of samples. This is made simpler by using squeezed states, for which only the two squeezing 
parameters need be kept track of. The problem of deriving the SDEs for the squeezing parameters, 
as well as much of the background theory of the evolution of the state, is described in Sec. |3.3| . 

A series of different phase feedback schemes that give results progressively closer to the theo- 
retical limit are described in Sees B.l: to 3.7. The last of these is a corrected feedback scheme that 
gives variances within about 5% of the theoretical limit. Lastly in this chapter, the possibilities of 
surpassing the theoretical limit are considered. 

The perturbation approach for time delays that is considered in Ref. p5[ is repeated in a more 
rigorous way in Ch. ^. It is shown that the same result is obtained for the final value of the 
intermediate phase estimate, but this approach fails when it is performed for mark II measurements. 
In Sec. [D^ an alternative approach based on the POM is considered that gives a result that is valid 
not only for the mark II measurements, but for all other feedback schemes. These results are all 



backed by extensive numerical calculations in Sec. 4.4 



In Ch. y the optimum input states for interferometry are derived, as evaluated using the Holevo 
variance for ideal measurements. The phase variance for these states scales as TV -2 , so the uncer- 
tainty scales as TV -1 . The Holevo variance for three alternative input states is calculated, and it is 
shown that all of these have poorer scaling. The optimum input states are then evaluated under 
several alternative measures of phase uncertainty, and are shown to have iV^ 1 scaling for all of these. 

In Ch. ^ a feedback scheme is introduced that approximates the ideal measurements very closely. 
For optimal input states, the scaling in the phase variance under this measurement scheme is very 
close to N~ 2 . For photon numbers above 5, this measurement scheme is not exactly optimal, but it is 
possible to solve numerically for the optimal feedback scheme (for small photon numbers). Although 
it is not always possible to obtain variances as small as canonical, using this feedback scheme it is 
possible to obtain variances smaller than canonical for some states. In Ch. || the resolution of this 
apparent contradiction is discussed. 

The last area that is considered in this thesis is that of continuous measurements. If we wish to 
transmit information via the phase, a pulsed signal with a single phase is not very useful, as only 
a single real number is transmitted. In order to transmit a significant amount of information we 
can either send a whole series of pulses with different phases, or produce a constant beam with a 
fluctuating phase. This is the alternative that is considered in Ch. ^. 

In this chapter both the cases of dyne measurements and interferometry are considered. Unfor- 
tunately the non-classical states with reduced phase uncertainty do not necessarily have equivalents 
in the continuous case. This is because the reduced phase uncertainty is due to the back-action of 
the measurement on the state. If the state does not change, then it is not possible to get the reduced 
variance. 

In the case of dyne measurements it is possible to consider squeezed states by altering the 
statistics for the detections in time dt, but keeping a constant. Therefore both the coherent and 
squeezed state cases are considered in Ch. 0. It does not seem to be possible to use this method for 
interferometry, as we cannot alter the statistics for individual detections. Therefore only the case 
with all photons in one port is considered in this chapter. 



Chapter 2 



Optimal Input States for Dyne 
Measurements 

There are, in general, two areas for improvement in the use of dyne measurements for phase esti- 
mation. Firstly there is the input state, and secondly there is the measurement technique that is 
used. In this chapter I will be focusing on optimising the input states for minimum phase variance 
under various types of dyne measurements, and in the next chapter I discuss how to perform the 
best possible dyne measurements. 

In optimising the input states, there must be some constraint placed on the state, because we 
can reduce the phase variance indefinitely by using larger and larger photon numbers. The limit of 
this is phase eigenstates, which have zero intrinsic phase variance, but infinite mean photon number. 
There are two main ways in which we can constrain the input states in order to avoid this problem. 
One way is to place an upper limit on the photon number that the state can have contributions 
from, and another way is to fix the mean photon number, but allow the state to have contributions 
from indefinitely large photon numbers. 

A third alternative is to consider squeezed states rather than arbitrary states. For these states 
we can optimise the squeezing parameter while keeping the mean photon number fixed. The reason 
for considering this case is that it is rather more realistic than considering arbitrary states, as it 
is not possible to produce an arbitrary state experimentally, whereas it is possible to produce a 
squeezed state. In addition squeezed states are easily treated numerically, as only the two squeezing 
parameters need be considered, rather than the full state. A third reason is that squeezed states 
give results extremely close to general optimised states for realistic dyne measurements. 

I will firstly consider the case where there is an upper limit placed on the photon number. 

2.1 Upper Limit on Photon Number 

This case is far simpler than the case of a fixed mean photon number, and the analytic results were 
derived before this study commenced. I will summarise the derivations here, however, as they are 
necessary to understand the numerical results that will be presented. In addition, these derivations 
cast light on how to derive the more complicated results with fixed mean photon number. 

2.1.1 Canonical Measurements 

Firstly I will consider the optimum states for canonical measurements. This is the simplest case, 
and in fact the only case which is exactly soluble. It is solved, for example in Refs j37j and |35j. As 
discussed in the introduction, the measure of the phase variance that will be used throughout this 
thesis is the Holevo phase variance, 

y(0) = |< e ^)r 2 -i. (2.i) 



18 



2. 1 . UPPER LIMIT ON PHOTON NUMBER 19 



For simplicity, the mean phase of the optimise d s tate can be taken to be zero. This means that 
(e 1 ^ is real. Therefore, rather than using Eq. (2.1), we can use 

^) = |i«e i *> + <e- < *»r 2 -l 

= (cos0)~ 2 -l. (2.2) 

This means that minimising the Holevo phase variance is equivalent to maximising (cos <j>) . Thus 
minimising the Holevo phase variance is equivalent to minimising the measure of the variance 

V'{(j>) =2(1- (cos<£)). (2.3) 

In addition, this measure of the phase variance has the same value in the limit of small variance as 
the Holevo phase variance. This measure is not exactly equal to the Holevo phase variance, and as 
is shown later it differs in the higher order terms. This is significant for general dyne measurements 
where we wish to obtain higher order terms in the expressions for the phase variance. 
Now recall from Eq. (1.52) that 

oo 

(e«) = ^(^|n)(n+l|^)fr n , n+1 . (2.4) 
This means that we can represent as the operator 

oo 

exp(i0) = \ n )( n + MHn, n +i, (2.5) 
and we can represent cos <j> by the operator 

oo 

2coT0 = [\ n )( n + 1| + |n+ l){n\] H n . n+1 . (2.6) 

n=0 

When we put an upper limit of N on the photon number, we can replace this with 

N-l 

2cos4>= [\n)(n+l\ + \n+l)(n\]H n , n+1 . (2.7) 

n=0 

For canonical measurements H n n+ i = 1, so this becomes 

N-l 



2cos<£= ^2 [\ri){n + l\ + \n + l){n\]. (2. 



n=0 

In general, when we wish to maximise (or minimise) the expectation value of some Hermitian 
operator A while keeping the expectation values of other Hermitian operators B, C . . ., constant, we 
use the method of undetermined multipliers. For this, we require that the matrix elements of all of 
these operators, (n\X\m), be real, so the corresponding matrices are symmetric. 

The state will be expressed as — 'Yli n i^n\n) , where the coefficients ip n are real. It is not 
possible to obtain any smaller phase variance using complex %j) n . To see this, note that Eq. (2.7) 
gives 

N-l 

2^COS</^ = Y [V'nV'n+l + V'n+lV'n] -ffn.n+l 
n=0 
N-l 

= 2Re H n>n+1 . (2.9) 
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From this it is clear that the maximal value of (cos</>) is obtained for real i/%ip n +i. This in turn 
implies that the minimum phase variance will be when arg(Vn) is independent of n. Without loss 
of generality arg(Vn) can be taken to be zero. 

Therefore, taking the tpn and increments dtf> n to be real, we obtain 

d(ip\X\ij) = {d{^\)X\^) + ^\X{d\i>)) 

= ^#„ (<n|X|V> + (tl>\X\n)) 

n 

= 2(d(Tp\)X\ip). (2.10) 

If we have a maximum of (A) while (B), (C) . . . are kept constant, then for any increment (d\ip)) 
which does not change the expectation values of B, C . . ., the increment in (A) must be zero also. 
This means that if 

{d(1>\)X\1>)=0, (2.11) 

for X = B, C . . ., then {d(ip\)A\tp) must be equal to zero. This means that A\ip) cannot be linearly 
independent of B\ip), C\tj)) . . ., so it must be possible to write 

(ai + #B + 7C + . . .) |V) =0, (2.12) 

for some combination of constants a, (3, 7 . . .. 

We wish to maximise the expectation value of the above operator with the single constraint that 
the state is normalised. Using the method of undetermined multipliers gives 

a (2cosV) + 131 \t/>) = 0. (2.13) 

Rearranging this gives the eigenvalue equation 

(2cos>) |V) = ( 2 - 14 ) 

When we expand the state in terms of the photon number states = 2 n =o '*Pn\ n )i we find 

N-l N 

(lpn+l\n)+1pn\n+l)) = J2 l/ ^n\ n )- ( 2 ' 15 ) 
n=0 n=0 

Rearranging this gives 

N-l N N 

^2 ipn+i\n) + Y i>n-i\n) = X! v ^^\n). (2.16) 



n— n— 1 n— 

This equation is equivalent to the recurrence relation 

Vn+l = Wpn ~ 1pn-l, ( 2 -17) 

for < n < N, with the boundary conditions 

Vi = W>o, 

Vjv-i = v^n- (2.18) 



The recurrence relation ( 2.17 ) is satisfied by exponentials of the form 

V„ = e ±macos ^. (2.19) 

To obtain real solutions, we can use 

V„ = Re(Ae™ acos *) . (2.20) 
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The boundary conditions can be more conveniently expressed by extending the range of the recur- 
rence relation to < n < N, and taking 

1>-i = o, 

i, N+1 = 0. (2.21) 
The first boundary condition then gives the phase of A as 

argA = acos- - -. (2.22) 
6 2 2 

This boundary condition does not give the magnitude, which must be found by normalisation. The 
last boundary condition gives 

(N + 2)acos^ = fcTr, (2.23) 
where k is an arbitrary integer. The eigenvalues are therefore 

^ = 2cOS (jV + 2)' (2 ' 24) 
and the coefficients for the corresponding (unnormalised) eigenvectors are 

ip n = Re -le N + 2 



(n + l)kir 
N + 2 



(2.25) 



It is clear from Eq. (|2.14j) that 

'2cos0)=i/, (2.26) 



1/(0) = (v/2)- 2 - 1. (2.27) 
Therefore the eigenvalue that minimises the Holevo phase variance is the maximum eigenvalue, 

7/1 = 2008 (^2)' (2 ' 28) 
For this eigenvalue the exact Holevo phase variance is 

- to " 2 {jfh) ■ (2 - 29) 

The state is given by 



VN/ 



1 _y s ^((jl+}h.) ln) . (2 .30) 



Here the factor of l/y/N/2 + 1 is required for the state to be normalised. 
Note that for large photon number we have 



V{4>) = tan 2 



7T 



7T 



Ar + 2 
2 



iV + 2 
2 



7T 

This means that the minimum intrinsic phase variance scales down as rapidly as N~ 



(2.31) 
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2.1.2 General Dyne Measurements 

The method of solution for canonical measurements can be generalised to the case of more general 
dyne measurements, where instead of H ntn+ i — 1, we have 



H n ,n+\ = 1 - h(n) 
w 1 - cn-P. 



This case was also considered in Ref. |3a| . Again the eigenvalue equation is 

(2c5sV) = v\i/>), 

except this time 



(2.32) 



(2.33) 



N-l 



2cos<^> = 



[\n){n + l\ + \n+l){n\]H ntn+1 

n=0 
N-l 

J2 [|n)(n + l| + |n+l)(n|](l-h(n)). 



(2.34) 



n=0 



Expanding this out gives 

JV-l N 



N-l 



N 



^2 ipn+i\n) + Y ^n-i\n) - X! i h ( n )^n+i\n) + h(n)tp n \n + 1)] = ^ ^ n \n), 

n—Q n—1 n—0 n— 

N N-l N N 

22 i>n+i\n) + }^ip n -i\n) - ^ K n )^n+i\n) - h(n - l)^n-i|") = J] vil>n\n). (2.35) 



N-l 



n=0 



This gives the recursion relation for < n < N, 

ip n+ i + ip n -i - h(n)ip n+ i - h(n - l)ip n -i = vij) n . 



(2.36) 



In order to solve this, in Ref. [p5|] the photon number n is treated as a continuous variable, and 
the coefficients ipn are replaced with the function ip(n), which is assumed to be twice differentiable. 
Then the approximations are made 



ip(n + 1) + ip(n - 1) 



dn 2 



h(n)ip(n + 1) + h(n - l)ip(n - 1) « 2h(n)ip(n). 



(2.37) 
(2.38) 



The second approximation (2.38) is based upon using h(n) w h(n— 1) and %l)(n+l)+ip (n—1) w 2if>(ri) 
The second derivative is not used, because this term is already much smaller than ip(n). 
With these approximations, the eigenvalue equation becomes 



d 2 

2 + - 2c7i- p ) ij)[n) = uip(n). 



(2.39) 



In Ref. ]35| , the term 2cn p is linearised about n — N, and the variables are changed to y = 1—N 1 n. 
The equation is then 

(2.40) 



where 



( d 2 \ 

\~~dy 2 +by ) = ak ^ y ^ 



a k = N 2 (2-is k -2cN~P), 
b = 2cpN 2 - p . 



(2.41) 
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The boundary condition is ip(0) = 0, and the boundary condition at y — 1 is ignored. 

Note that, similarly to the case for canonical measurements, the exact boundary conditions 
should be if>_i = V'w+i = 0. In terms of y, this means we should have ip(y) = for y = —1/N and 
1 + 1/N . The 1/N terms in these boundary conditions are ignored, as they merely give higher order 
corrections to the results. 

Using these approximations, the solutions are 

My) °< Aiizk + b^y), (2.42) 

where Ai is the Airy function and z& is the fcth real zero of the Airy function satisfying > z\ > 
Z2 > .... The eigenvalues are 



(2cN- p + \z k \ (2cp) 2/3 N- 2il+p)/3 



(2.43) 



and so it is obvious that the maximum eigenvalue is for k = 1. The corresponding Holevo phase 
variance is 



v(4>) = K/2)- 2 - 1 



i 



cN -p+ i |z 1 |(2cp) 2 / 3 iV- 2 ( 1 +f)/ 3 ) 



- 1 



1 + (2cN- p + \ Zl \ (2cp) 2/3 7V- 2(1+p)/3 ) + 3(cN- p ) 2 
2cN- p + \zi\ (2cp) 2/3 N- 2{1+p)/3 + 3c 2 N- 2p . 



(2.44) 



This method should be used rather than using the approximation V(</>) ~ 2 — v k (as is used in [j35[), 
as the third term here will be of the same order as the second for mark I measurements. 

The value of the first zero of the Airy function is z\ w —2.33810741045976. Using these results, 
and the values of c and p for heterodyne, mark I and mark II measurements given in Table 1.1, the 
variances for these measurements should be given by 



v(4> het ) 
v{<k) 



1 



0.927879 x 7V~ 4/3 , 



~ 1/2 + 0.631402 x N- 



\N 



\N- 3/2 + 0.765947 x N~ 5/3 . 



(2.45) 
(2.46) 
(2.47) 



The second term for mark I measurements here includes the third term from Eq. (2.44). This term 
has been omitted in the other two cases, as it is of higher order. As is discussed in the next section, 
hi(n) has an extra term of order A^ _1 . This term would also need to be taken into account in order 
for the second term for mark I measurements to be accurate. 



2.1.3 Numerical Results 

These analytic results have been verified numerically by calculating the optimised states for het- 
erodyne measurements and adaptive mark I and II measurements. For moderate maximum photon 
numbers the calculations were exact. For heterodyne measurements the vector /ihet( n ) is given 
exactly by Q 

, \ r(™ + 3/2) 

h hct (n) = 1 - V '/ ==. 2.48 

(n + l)T(n + 2) 

This form of the equation can not be used for large n due to roundoff error. For n > 10 the first 
twelve terms of an asymptotic expansion were used, giving results as accurate as or more accurate 



than the exact expression (2.48). These terms were determined from the asymptotic expansion for 



logT (n) ]46fl , and are given below: 

, , . 1 1 5 21 399 869 

h het(n) ~ ~q7~ I TT - o7 / , ^ 2 " oiol I TVS + 



..,„ , 1) 2 7 (n + l) 2 2 10 (n + l) 3 2 15 (n + l) 4 2 18 (n + l) 5 2 22 (n + l) 6 
39325 334477 28717403 59697183 8400372435 34429291905 



2 25 (n+l) 7 2 31 (n+l) 8 2 34 (n + l) 9 2 38 (t 1 + 1) 10 2 41 (« + 1)" 2 46 (n+l) 12 

(2.49) 
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The exact expression for mark I measurements is slightly more complicated. From |34| the exact 
expression for H mn for mark I measurements is 

Lm/2J |n/2J 

= E E 7m P 7™ 9 M P ' 9 , (2.50) 

p=0 9=0 

where L m /2J is the integer part of m/2, 

lmp = 2P(m-2p)\pV { ' 

and M p ' q are calculated using the recursion relation 

2(n — m) 2 + n + m 

with the boundary values 



M U ' m = _J 2 ; - ; - , (2.52) 



M n,0 = M 0,n _ = / 2 53 s 

(2n + l)(2ra- l)...l (2n+l)!! V ' 

The values of h\(n) = 1 — If* n+l were calculated up to n = 3000 using this expression. Further 
values were extrapolated by fitting an asymptotic expansion to the results below 3000. The first 
term, l/(8y/n), was assumed, and three further terms were obtained by fitting techniques. The 
terms found were 

, , , 1 0.101561734071163 0.0508542807551548 0.0959147066823866 ,„ 
hi(n) w —— jj- 2 + - 2 . (2.54) 

These coefficients were the exact values used in calculations, but the digits given do not reflect the 
accuracy of the fit. (I have given all digits used so that it is possible to accurately reproduce the 
results.) 

The exact expression for mark II measurements is even more complicated than that for mark I 
measurements. From |34 the expression is 

|m/2j L«/2J / , v ( n -m)/2 \ 

ff "n-E 77m DP(D*y) • (2.55) 

P=0 9=0 \ ^ ^ I Q 

where D = BA* /A, and the subscript Q indicates that the expectation value is for the "ostensible" 
or vacuum distribution. This notation differs slightly from that in |34f] , where the symbol C was 
used, rather than D. In Ref. [|4| it is shown that 

(D n D* m ) Q = M"' m , (2.56) 

where M n ' m is calculated as above. In order to obtain h\i{n) we require H^ n+1 . In order to calculate 
these we can expand Eq. (2.55) to obtain 

Ln/2j L(»+1)/2J oo oo , , ,s , , n 

<n +1 =E E 7np7n + i.,EE ( / fc T Mfc+p,,+g ' (2 - 57) 

p=0 9=0 fe=0 (=0 

where 

(a) n = a(a+l)...(a + n-l). (2.58) 

The infinite sum in ( 2.57| ) can be determined to within double precision (15 digits) by summing 
about the first 100 terms. The values of h\\{n) were calculated up to n — 1000 in this way. I 
attempted to obtain the higher order terms using this data set in a similar way as for the mark 
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Figure 2.1: The minimum phase variance for heterodyne measurements on states with an upper 
limit N on the photon number. The exact calculations are shown as the crosses, the continuous 
approximation as the circles, and the asymptotic analytic expression as the continuous line. 



I case, but unfortunately it was not found to be possible to consistently obtain any terms beyond 
in -1 - 5 . Therefore all elements beyond n — 1000 were determined using hu(n) « j^nT . 

Using the above methods for calculating h(ri), the minimum phase variance was determined 
by numerically determining the eigenvalues for heterodyne, mark I and mark II measurements, for 
photon numbers up to 2 18 . These calculations were exact except for the above approximations to 
h(n) for mark I and II measurements. 

For larger photon numbers it was not feasible to solve the exact eigenvalue problem, but an 
approximate solution was obtained by using the continuous approximation of the eigenvalue problem 
and discretising it. In order to reduce the number of intervals required in the discretised equation, 
the equation was solved for three different numbers (512, 1024 and 2048) of intervals. The result 
for the continuous case was then estimated by projecting to zero step size assuming the error is 
quadratic in the step size. The approximations that were not made (that were made in order to 
derive the analytic result) were the linear approximation of h(n) and the omission of the boundary 
condition at y = 1. 

The results for heterodyne measurements are shown in Fig. 



2.1 



. The results for the exact calcu- 
lations agree extremely well with the results for the continuous approximation over the region where 
both values have been calculated. This indicates that the continuous approximation is a very good 
approximation of the exact eigenvalue problem. 



The results also agree well with the asymptotic expression ( 2.44 ) , with good agreement for photon 
numbers above 100. In order to better see the difference between the numeric results and the analytic 
expression, and in particular to see how accurate the second term in the analytic expression is, it is 
convenient to define the parameter z by 



V((j>) - 2cN-p 
(2cp) 2 /37V- 2 ( 1 +p)/ 3 ' 



(2.59) 



From Eq. (2.44), provided the third term 3c 2 N 2p can be ignored, this should converge to \zi\ ~ 
2.338107. The values of z for heterodyne measurements are plotted in Fig. 2.2. Again there is 



extremely good agreement between the values calculated exactly and those calculated using the 



26 



CHAPTER 2. OPTIMAL INPUT STATES FOR DYNE MEASUREMENTS 



1 Or 



n 6 



5 

4- 
3- 



10' 



10' 



-Q_ ex a- a & 



10° 



10° 



N 



Figure 2.2: The value of z for heterodyne measurements on states with an upper limit TV on the 
photon number. The exact calculations are shown as the crosses, the continuous approximation as 
the circles, and the theoretical asymptotic value of \z\\ » 2.338107 is shown as the dash-dotted line. 



continuous approximation. The value of z does not converge closely to \z\\ until a photon number 
of around 10 6 . In fact, we do not have 1% agreement until a photon number of 8 x 10 6 . 



The results for mark I measurements are plotted in Fig. 2.2. The asymptotic expression plotted 
here is Eq. ( 2.46| ), which includes the extra 3c 2 N~ 2p term. There is good agreement between 



the results and this asymptotic expression for photon numbers above about 10. There is also 
good agreement between the values calculated exactly and those calculated using the continuous 
approximation. 

For mark I measurements the values of z cannot be expected to converge to \z±\ due to the extra 
3c 2 A~ 2p term. From the asymptotic expression ( [2.46 ), the asymptotic value including this term 



should be 2.525607. As can be seen in Fig. 2.4, however, z does not converge to this value. The 



reason for this is that the second term of the asymptotic expression for V{4>\) is of the same order as 



the second term in the asymptotic expression for h\(n) in Eq. (2.54). Taking account of this term, 
V(<f>i) should be 

V(<f>i) w \N- 1/2 + 0.428278 x AT" 1 , (2.60) 

and z should converge to approximately 1.713114. This value is also plotted in Fig. As can be 
seen, the results converge quite accurately to this value. The results for the largest photon numbers 
do not agree accurately; however, this is just due to poor convergence of the numerical technique. 



The results for mark II measurements are plotted in Fig. 2.5. In this case there is good agreement 
between the results calculated using the two different methods, and the results appear to be close 
to t he a symptotic result for photon numbers above about 10 5 . When we look at the values of z (see 



Fig. |2.6|) , we see that there is again excellent agreement between the results calculated using the two 
different methods, but there is good agreement with the asymptotic value only for photon numbers 
above 10 10 . In fact, we require a photon number above 5 x 10 14 in order to have better than 1% 
agreement. 

Note that in |35| it is claimed that there should be good convergence to the asymptotic values 
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Figure 2.3: The minimum phase variance for mark I measurements on states with an upper limit TV on 
the photon number. The exact calculations are shown as the crosses, the continuous approximation 
as the circles, and the asymptotic analytic expression as the continuous line. 




Figure 2.4: The value of z for mark I measurements on states with an upper limit N on the photon 
number. The exact calculations are shown as the crosses, the continuous approximation as the 
circles, and the theoretical asymptotic value of 2.525607 is shown as the dash-dotted line. The 
asymptotic value taking into account the second term for h\(n) is shown as the dotted line. 
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Figure 2.6: The value of z for mark II measurements on states with an upper limit N on the photon 
number. The exact calculations are shown as the crosses, the continuous approximation as the 
circles, and the theoretical asymptotic value of \z\\ w 2.338107 is shown as the dash-dotted line. 
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for 




,3\ V(i-f) 



(2.61) 



This means that there should be good agreement with the asymptotic values for photon numbers 
above about 4000, 400 and 3 x 10 7 for heterodyne, mark I and mark II measurements respectively. 
The numerical results obtained here show that, in order to have better than 1% agreement with the 
asymptotic results, the maximum photon numbers should be above about 8 x 10 6 , 4000 and 5 x 10 14 



These apparent discrepancies of many orders of magnitude are not so great when considered in 
terms of the difference in z rather than the photon number. For a photon number close to 4000 
for heterodyne measurements, z is about 13% from the asymptotic value, which is not very poor 
agreement. Similarly for a photon number around 400 for mark I measurements the value of z is 4% 
from the asymptotic value, and for a photon number around 3 x 10 7 for mark II measurements the 
value of z is 17% from the asymptotic value. Nevertheless, these discrepancies are still larger than 
would be expected. 

To understand the reason for this, note that the criterion used in |3j| was that 99.5% of the 
solution if>(y) [as given by Eq. ( p.42| )] be confined within the lower half of the interval [0, 1]. This 
criterion means that the approximation that we can ignore the boundary condition at y = 1 is very 
good, but it is a fairly weak criterion for the approximation that we can linearise h(n) about n = N 
(equivalent to y = 0). A better criterion for this approximation to be accurate is that 99.5% of the 
solution be confined within y < 0.04. This criterion means that we require 5&- 1 / 3 < 0.04, where 
b = 2cpN 2 ~ p , and gives 



With this criterion the minimum photon numbers for heterodyne, mark I and mark II measurements 
are 8 x 10 6 , 6 x 10 4 and 10 14 respectively. With the exception of the mark I result, these values are 
much closer to the photon numbers required for 1% agreement with the asymptotic value of z. 



The next case that I consider is that where, rather than an upper limit being put on the photon 
number, the mean photon number is fixed. This case is rather more complicated, as the method 
of undetermined multipliers gives two undetermined constants, so the problem is more complicated 
than a simple eigenvalue problem. 

2.2.1 Canonical Measurements 

The simpler case of canonical measurements with a fixed mean photon number was solved by Summy 
and Pegg p7[ . I will give the derivation here, as it is simple and indicates the method of solution 
for the case of general dyne measurements. I am giving a different but equivalent version of the 
derivation for consistency with the other derivations given in this chapter. 

We wish to maximise the expectation value of 2cos <fi while keeping the state normalised and the 
photon number constant. Using the method of undetermined multipliers gives the equation 



respectively. 




(2.62) 



2.2 Fixed Mean Photon Number 




(2.63) 



Rearranging this gives 




(2.64) 



This has two unknown constants, and therefore cannot be solved as a simple eigenvalue problem. 
Instead what we do is solve it as an eigenvalue equation for v with a fixed value of [i, and the 
eigenstate corresponding to the maximum eigenvalue is an optimised state. The mean photon 
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number can then be found from this state. We can obtain a range of mean photon numbers by 
adjusting /z. 

Expanding the equation out in terms of the number coefficients of \ip) gives 

oo oo oo oo 

^ i> n +i\n) + yii/> n -i\n) - 22 (iip n n\ri} = ^ v i>n\n), (2.65) 

n— n— 1 n— n— 

which implies the recurrence relation 

t/>„+i + V'n-i - Vi> n n = vn\) n . (2.66) 

Similarly to the derivation for general dyne measurements when there is an upper limit on the photon 
number, we can use the continuous approximation where we replace ip n with ip(n), and use 



ip(n+ 1) + 4>(n- 1) 



2 — 

dn 2 



ip(n). (2.67) 



With this approximation the recurrence relation becomes the differential equation 

d 2 \ 

-j-r+ZmUn = p-i/Wn. (2.68) 

This equation is exactly equivalent to the differential equation obtained in the maximum photon 
number case ( 2.40 ), and it therefore has the solutions 

V»k(n) oc M(z k + li 1/a n), (2.69) 

with the eigenvalues 

Vk = 2-\z k \^\ (2.70) 

It is clear that the solution that maximises v is again that with k — 1. In Rcf. |^ the authors say 
that the solution must have a zero for n — — e, where < e < 1. The boundary condition for the 
discrete equation is il'-i = 0, which implies that the solution should have a zero for n = — 1. I will 
take if>(0) — here, as the difference only gives higher order terms to the solution. 
It can be shown numerically that 

' z x + Li 1/3 n^ « -0.779369136819922. (2.71) 

For brevity I will call this constant (X) (for consistency with jj7) ) . Rearranging this gives 

1/3 = (jM , (2J2) 

(n) 

This gives the relation between the mean photon number and fi. The eigenvalue is then 

, i = 2 _ M(W- a ) a . 

(n) 



From Eq. ( |2.64| ) it is clear that 



(2cos4>) - fi(n) = v. (2.74) 



Substituting the values for \i and i/ from above gives 



, ({X)- Zl f \ Zl \ ((X) - Zl ) 2 

(2cos0) - vv [ x2 1 ' = 2- 1 11 u 7 2 — (2.75) 



This simplifies to 

2 -/^) = ajM! (2.76) 
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Figure 2.7: The phase variance multiplied by the square of the mean photon number for states 
optimised for minimum phase variance with the constraint of fixed mean photon number. The 
theoretical asymptotic value of 1.893606 is shown as the dash-dotted line. 



Therefore the phase variance is 

= 1.89360591826155 

n 

This is the result obtained in J37{ . Therefore we see that the phase variance for optimum states 
scales as n -2 , the same as for the case with an upper limit on the photon number. Note that it is 
accurate to approximate the Holevo phase variance by 2 — ^2cos0^) here, as the differences will be 
of order n~ 4 . 

This analytic result has been verified by determining the optimum states numerically. The eigen- 
value problem was solved with various values of /j,, and for each value of /j, the maximum eigenvalue 
was chosen, and the mean photon number was determined from the corresponding eigenstate. In 
order to make the problem finite, the state coefficients were only considered for photon numbers up 
to around 11 times the mean photon number of the state. At this point the state coefficients had 
fallen to around 10~ 16 . 

The results multiplied by n 2 are plotted in Fig. [2.7j . The results converge rapidly to the asymp- 
totic value, with agreement within 1% of the asymptotic value for mean photon numbers above 
about 250. Note that it does not make sense to perform calculations with the continuous version of 
the eigenvalue equation, as this case was solved exactly. 



2.2.2 General Dyne Measurements 



Now I will consider the case of optimising for minimum phase variance with a fixed mean photon 
number for the case of more general dyne measurements. The complete derivation is original to 
this study, and is based on a partial derivation by Zhong-Xi Zhang (personal communication). This 
derivation was published in a brief form in j|7| . 

Similarly to the case of canonical measurements, the method of undetermined multipliers gives 
the equation 

' > = v\ip). (2.78) 



2cos <t>) - fj,N 



32 



CHAPTER 2. OPTIMAL INPUT STATES FOR DYNE MEASUREMENTS 



For the case of more general dyne measurements, we have 

oo 

2cos0 = [\n)(n+l\ + \n + l)(n\] (1 - h(n)). 

n=0 

If the state is expressed in the number states basis 

oo 
n=0 

then Eq. (|2.78| ) can be expanded to obtain 

oo oo oo oo 

^ 1pn+l\n) + y^V'w-lk) - X] H^n+lln) - ^ h ( n ~ Wn-ll 

n=l 

oo oo 

- ^2 fiip n n\n) = ^ WVil? 



(2.79) 



(2.80) 



71=0 



n=l 



n=0 



n=0 



This gives the recurrence relation 

ipn+l + ipn-1 - h(n)lp n+ i - h(n - l)ip n -i - ^ n n = vip v 



(2.81) 



(2.82) 



Similarly to the derivation for an upper limit on the photon number, we can treat n as a contin- 
uous variable, and make the approximations 



&2 

dn 2 



ip(n + 1) + ip(n - 1) 
h(n)%l>(n + 1) + h(n - l)ip(n - 1) w 2h(n)^(n) 
Using these approximations, Eq. ( |2.82 ) becomes 

d 2 V>(") 



ip(n) 



dn 2 



+ tp(n) [2h(n) + v + fin - 2] w 0. 



Now define 



/(n) = 2/i(n) + ^ + fin - 2. 



To solve Eq. (2.84), it is convenient to expand f(n) in a Taylor series around 



n 



M 
2cp 



The derivatives of f(n) are 



/(n ) = 2h(n ) + v + /xn - 2, 
/'(n ) = 2/j'(n )+ M = 0, 
/"(n ) - 2/i"(n ) = 2cp(p+ l)n ^ 2 , 
/'"(n ) = -2 C p(p+l)(p + 2)n-^ 3 . 



(2.83) 

(2.84) 
(2.85) 

(2.86) 



(2.87) 
(2.88) 
(2.89) 
(2.90) 



Note that this technique requires that the number distribution has its maximum near no- This is 
justified in the derivation in Appendix 



A.l 



Using the Taylor series for f(n), and defining f = /(n ), fi = /"(n )/2 and = /"'(no)/6, 
Eq. Q2.84D becomes 



dn 2 



+ [(n - n ) 2 / 2 + (n - n ) 3 / 3 ] \ r/>(n) « -/o^W 



(2.91) 
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Note that — fo = 2 — (2h(no) + v + fmo), so the above equation i s equ ivalent to solving (2.78) as 
an eigenvalue equation for v with a fixed value of fi. Now Eq. (2.91) is equivalent to the time- 
independent Schrodinger's equation with energy eigenvalue 



E = -f , 

for a perturbed harmonic Hamiltonian H . We can apply perturbation theory with 

Hq + H\, 

-5? + [*»-"•>*]. 

Hi = h{n-n f. 



H 

Ho 



(2.92) 

(2.93) 
(2.94) 
(2.95) 



This perturbation theory derivation is fairly lengthy, and the details are contained in Appendix 
|A.1| . It is shown that the energy eigenvalue of the unperturbed ground state is yffe, and that the 
mean photon number for the perturbed state is 



n ss no 



p + 2 



,P/2 



4 v / cp(p + 1) 

Using this we can find the minimum phase variance based on 

(2cos0) max = (v + (in) max . 

This can be evaluated as 

v + fin = 2 - 2h(n) + f(n). 
Using a Taylor expansion for f(n) gives 

v + fin = 2 - 2h(n) + fo + (n - n a ) 2 f 2 . 

Using the result for n in Eq. ( 2.96| ), the last term is of order TT 2 , 
omitted here. 

It is shown in Appendix A.l that the energy eigenvalue of the state corresponding to the smallest 
phase variance is \ff~2. The correction found by perturbation theory is of order n~ 2 , and can be 
omitted. As —fo is the energy eigenvalue, we get 



(2.96) 

(2.97) 
(2.98) 

(2.99) 

which is small enough to be 



v + fin = 2 - 2h(n) + ^/fc 

= 2 - [2ch- p + ^/cp{p+l)n- p/2 - 1 
The Holevo phase variance is then given by 



V(<f>) = [{y + fin)/2}- 



1 



2cn- p + ^/cp(p+l)n-' p/2 - 1 + 3c 



2 n~ 2p . 



(2.100) 



(2.101) 



Note that the first term here is the same as the result when an upper limit is put on the photon 
number, but the second term scales as a different power of n. Similarly to the case where there is 
an upper limit on the photon number, the phase variance should be found in this way rather than 
using V{4>) ~ 2 — (y + fin). This method gives the extra term 3c 2 fi~ 2p , which is of lower order than 
the second term for mark I measurements. This term can be ignored in the other two cases, as it is 
of higher order. 

Using this relation, the Holevo phase variance for the three cases, heterodyne, mark I and mark 
II, should scale as 



U(0 hct ) p 




1 + K 3 / 2 , 


+ ^ r n^\ 
4V2 


(2.102) 


U(0i) p 






(2.103) 


V(<f> n ) p 


a \n 


a/o Vl5 




(2.104) 
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For mark I measurements there will also be a term of order n due to the second term in the 
expansion for h\{n). Taking this term into account the variance should be 



V(<h) 



\n- 1/2 - 0.156248 x rT" 1 



V3 



4\/2 



5/4 



(2.105) 



The case of heterodyne detection is of particular interest because it differs radically from the 
result claimed by D'Ariano and Paris |H of 



v(4> 



hot) 



1.00 ±0.02 

^1.30±0.02 ' 



(2.106) 



As the quoted errors suggest, this result was obtained entirely numerically, in contrast to the analyt- 
ical result obtained here. Judging from the graphs given in [Q, the numeric fit was performed for 
relatively small photon numbers, only up to about 100. As was found in previous sections, it gen- 
erally takes very large photon numbers for the asymptotic scaling to become evident. In Sec. 2.2.3 



I present numerical results that show that the analytic result in Eq. (2.103) is a far better fit than 
the power law of D'Ariano and Paris. 



2.2.3 Numerical Results 

These analytic results have been verified by numerically calculating the optimum states for dyne 
measurements with a fixed mean photon number. In these calculations the values of h{n) for hetero- 
dyne, mark I and mark II measurements were calculated using the formulae described in Sec . 2.1.3] . 
The phase variances and photon numbers were determined in a similar way as in Sec. 2.2.1. Fixed 
values of fj, were used, and for each value of \i the maximum eigenvalue was found, and the mean 
photon number was determined from the corresponding eigenvector. 

Similarly to Sec. 2.2.1 a cutoff was used at a photon number sufficiently above the mean photon 
number that the state had fallen to around 10~ 16 . In addition, calculations were performed using the 
continuous approximation in order to obtain results for very large photon numbers. The method used 
was similar to that used in Sec. 2.1.3. The phase variances and photon numbers were determined for 



512, 1024 and 2048 intervals, and the result for the continuous case was then estimated by projecting 
to zero step size. 

The results for the exact case and continuous approximation for heterodyne measurements are 
shown in Fig. 2.S. The power law claimed by D'Ariano and Paris is also shown in this figure. As 
can be seen, there is good agreement between the numerically calculated values and the asymptotic 
analytic expression for photon numbers above 100. There is also excellent agreement between the 
values calculated exactly and those calculated using the continuous approximation. 

Note that the power law of D'Ariano and Paris gives good agreement for photon numbers below 
100, better than the asymptotic analytic expression obtained here. For photon numbers above 100, 
however, this power law differs greatly from the numerically calculated values, demonstrating that 
this power law is not the correct asymptotic scaling. 

In order to see how accurately the numerical results agree with second term in the asymptotic 



limit, we can define the parameter z in a similar way as was defined in Sec. 2.1.3 



V((j>) - 2cn~P 
n-p/ 2 - 1 ' 



(2.107) 



Provided the term 3c 2 n 2p can be ignored, the asymptotic value of z should be yj cp(p + 1). This 



parameter is plotted in Fig. 2.9. Again there is excellent agreement between the values calculated 
exactly and the continuous approximation. There is still a small but significant difference between 
the numeric values and the asymptotic limit for a photon number of 100, but for photon numbers 
above 10 4 there is better than 1% agreement with the asymptotic value. 

The calculated results for mark I measurements are shown in Fig. 2.10. In this case the algorithm 
for calculating the continuous approximation did not give convergent results, so only the exact results 
are shown. There is good agreement with the asymptotic expression (2.103) for photon numbers 
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Figure 2.8: The minimum phase variance for heterodyne measurements on states with a fixed mean 
photon number. The exact calculations are shown as the crosses, the continuous approximation as 
the circles, and the asymptotic analytic expression as the continuous line. The power law claimed 
by D'Ariano and Paris for heterodyne detection is also plotted (dash-dotted line). 



N 




Figure 2.9: The value of z for heterodyne measurements on states with a fixed mean photon number. 
The exact calculations are shown as the crosses, the continuous approximation as the circles, and 
the asymptotic value of 1/2 as the dash-dotted line. The results for squeezed states are shown as 
the continuous line. 
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Figure 2.10: The minimum phase variance for mark I measurements on states with a fixed mean 
photon number. The exact calculations are shown as the crosses and the asymptotic analytic ex- 
pression as the continuous line. The analytic expression taking account of the second term for hi (n) 
is shown as the dotted line. 



above about 100. The asymptotic analytic expression taking account of the second term for h\(n) 
is also shown in this figure. As can be seen, the numerical results agree with this expression even 
more accurately, with good agreement for photon numbers above about 10. 

For mark I measurements there is a term of order riT \ so we can not expect z to converge to 
y/cp(p+l). Using Eq. fl2.105| ), we see that z should converge to 



^Jcp(p+ 1) - 0.156248 x n 1/4 . 



(2.108) 



The value of z is plotted in Fig. 2.11, and as can be seen it converges reasonably accurately to this 
value, but not to \J cp(p 



I)- 



The results for mark II measurements are shown in Fig. 2.12. There is very good agreement with 



the asymptotic analytic expression, and between the results calculated using the exact method and 
continuous approximation. If we plot z (Fig. 2.13| ), we find that the second term does not agree well 
until very large photon numbers. In fact, a photon number greater than 7 x 10 8 is required to obtain 
better than 1% agreement. The agreement is again poor for the largest photon numbers; however, 
this is just due to poor convergence of the numerical technique. 



2.3 Optimised Squeezed States 

As an alternative to optimising completely general states with a fixed mean photon number, we can 
restrict our attention to squeezed states. As mentioned in the introduction to this chapter, there 
are three reasons for this: 

1. Squeezed states are relatively easily generated in the laboratory, whereas there is no known way 
of producing general optimised states experimentally. 

2. Squeezed states can be treated numerically far more easily than general optimised states. 



3. It has been found numerically (see Sec. 2.3.4) that the phase uncertainties of optimised squeezed 
states are very close to those of optimised general states, and a partial theoretical explanation can 
be obtained by the following analysis. 
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Figure 2.11: The value of z for mark I measurements on states with a fixed mean photon number. 
The exact calculations are shown as crosses. The asymptotic value ignoring the second term in 
Eq. (|2.105| ) is shown as the dash-dotted line, and the asymptotic expression taking this term into 
account is plotted as the dotted line. The results for squeezed states are shown as the continuous 
line. 




Figure 2.12: The minimum phase variance for mark II measurements on states with a fixed mean 
photon number. The exact calculations are shown as the crosses, the continuous approximation as 
the circles, and the asymptotic analytic expression as the continuous line. 
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Figure 2.13: The value of z for mark II measurements on states with a fixed mean photon number. 
The exact calculations are shown as the crosses, the continuous approximation as the circles, and 
the asymptotic value of VT5/8 as the dash-dotted line. The results for squeezed states are shown as 
the continuous line. 



Squeezed states can be described by just two parameters, the coherent amplitude and the squeez- 
ing parameter. When the mean photon number is fixed there is only one independent parameter, 
and the optimisation problem is reduced to function minimisation in one dimension. 

2.3.1 Canonical Measurements 



Again the simplest case is that of canonical measurements. This case was solved by Collett |45 , and 
I will outline the derivation here. The squeezed states considered are of the form 

\a,C) =exp(aa t -a*a)exp[(C*a 2 -Ca t2 )/2]|0). (2.109) 

The mean photon number for this state is given by 

n = \a\ 2 +sinh 2 |<| . (2.110) 

Using this relation we can take a fixed value of n, and vary £. The value of a can then be determined 
from n and £. 

One complication is the phases of a and £. Only relative phase is important, so the phase of £ 
can be considered relative to a. Specifically, if the phase of a is rotated by 8, an equivalent state is 
obtained by rotating the phase of £ by 29. We can therefore take the phase of a to be zero without 
loss of generality. 

All of the following analysis relies on £ being real also. We can see that £ should be real if we 
consider the phase quadrature diagram for the state. As in the introduction, the general quadrature 
A$ is given by 

^=oc- < * + o t e i *. (2.111) 
The and n/2 quadratures are therefore 

A = (a + a f ), (2.112) 
X n/2 = -t(o - a f ). (2.113) 
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Figure 2.14: Contours of the probability distribution for Xq and X^/ 2 . The contour for a coherent 
state is shown as the continuous line, and a contour for a squeezed state with <f>£ = n is shown as 
the dotted line. 



For squeezed states the expectation values of these operators are 



X ) = 2 \a\ cost/?, 



X 



ir/2 



2 |a| sin tp, 



where tp is the phase of a. In addition, the uncertainties of these quadrature operators are 



(a! 2 



e 2l -sin 2 i< 



e 2r cos 2 |c 



e 2r cos 2 |< 



e - 2r sin 2 ic 



(2.114) 
(2.115) 

(2.116) 
(2.117) 



where r and ^ are the magnitude and phase of Q. 

A common way of representing coherent and squeezed st ates is by a contour of the probability 
distribution for the measured values Xq and X„/2, as in Fig. 2.14 . The contour for a coherent state 
is a circle; however, the contour for a squeezed state is an ellipse, with the major axis at an angle 

If we are considering small deviations from zero phase, 



X 



X 



tt/2 



2\a\<p, 



(2.118) 
(2.119) 



so measuring Xq gives information about the photon number, whereas measuring X T / 2 gives in- 
formation about the phase. If the actual phase is zero, then <fi^ should be equal to 7r in order for 
X^/21 an d therefore the phase, to have a reduced uncertainty. This means that C should be real and 
negative in order to have a phase squeezed state. 

A complex value of £ will mean that the major axis of the ellipse is at an angle from the 
horizontal, and this should give a larger phase uncertainty than if the ellipse is horizontal. This is 
not a rigorous derivation, and I have found numerically (see Fig. 2.18| ) that for very small n the 
optimum £ is positive. Even for these exceptional values of n, complex values of £ were never found 
to give a better result. It is therefore reasonable to assume a real value of C in the following analysis. 

In order to determine the phase uncertainty, we wish to determine 



cos< 



n=0 



((a, C|n) (n + l\a, Q + (a, (\n + 1) (n\a, ()) . 



(2.120) 
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Collctt determines this sum using the number state representation of squeezed states as given by 
(n\a, = {n\^l\u/2^H n [(i{2^)-^] exp[-|/3| 2 /2 + (i/* /2u)/3 2 ], (2.121) 

where 



H = coshr, 

v = e 11 ^ sinh r, 

f3 = an + a* v. 

The H n are Hermite polynomials and are given by fll6[ 

L«/2J 



H n (x) = n! £ 



(-l) m (2x) 



n—2m 



' m!(n — 2m)! 

Note that this formula is for the general case of complex a and (. 
Collett evaluated this for real a and real, negative £, and found 



cos (j) } « erf(V2ni) 



1 



(cothr — 1) 



32niV2e r sinh i 



1 



1 

2n[ 



where 



n\ = ^a 2 (l — tanhr). 
This in turn gives the phase variance as 

^)« ! ^ + 2erfc(v / 2^) ) 



4n 2 

where no — ne~ 2r . Taking the derivative with respect to no gives 

d 



dno ^ 4n 2 



-2ra 



This is zero for 



or 



1 

in 2 



7TO0 



-2n„ 



(2.122) 
(2.123) 
(2.124) 



(2.125) 



(2.126) 
(2.127) 

(2.128) 

(2.129) 

(2.130) 
(2.131) 



no = log(4n) - \ log(27rn ). 

In Ref. j|5| the factor of uq is omitted in the second term on the right hand side. We can make a 
slightly better approximation by using no « log(4n) on the right hand side, so 

n « log(4n) - \ loglog(4n) - \ log(27r). (2.132) 

We can also include an arbitrary number of additional terms with iterated logs; however, this 
expression gives a very accurate solution, as shown in Fig. 2.17. 



Using the asymptotic expansion of the complementary error function 



erfc(z) 



,l-3---(2fc-l) 



fc=0 



(2x 



2\k 



(2.133) 



we have, for large no, 
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Using this result and substituting ( p. 132 ) into (2.128) gives the minimum phase variance as 

logrz. — \ loglog(4n) + A 



V(<f>) 



where 



A 



4n 2 



2 log 2 - \ log(27r) w 2.426825. 



(2.135) 



(2.136) 



Therefore we see that the squeezed state optimised for minimum intrinsic phase variance has a phase 
variance scaling as logn/n 2 . This is not quite as good as the result for general optimised states, 
which scale as n~ 2 . The factor of logn increases very slowly, however, and it requires very large 
photon numbers to produce a significant difference between the variances of optimised squeezed and 
general states. 

Note that the expression (2.135) for the variance differs from that given in p5| . The \ loglog(4n) 
term does not appear in pa], as the approximation uq ~ 1 is made on the right hand side of the 



solution for no (2.131). This term increases with photon number, and is therefore more significant 



than the A term. 



2.3.2 Numerical Results for Canonical Measurements 



This approximate analytic result has been tested numerically by determining the optimum squeezed 
states for a range of mean photon numbers up to about 8 x 10 7 . Similarly to the case for optimum 
general states, the state coefficients were considered until the point where they had fallen to about 
10~ 16 , where they did not significantly affect the results. 

The phase variances of the minimum uncertainty squeezed states are plotted in Fig. |2.15| . Also 
shown is the asymptotic expression (with the ^loglog(4n) term omitted for simplicity). There is 
good agreement between the calculated values and the analytic expression, with better than 1% 
agreement for photon numbers above 5 x 10 4 . Also shown is the analytic expression if we omit A, 
and there is much poorer agreement for the smaller photon numbers in that case. This means that 
although the A term is insignificant for large photon numbers, it gives more realistic results for 
moderate photon numbers. 

To demonstrate how close the phase variance is to that for general states, the r atio o f the phase 
variance for optimised squeezed states to that for general states is plotted in Fig. 2.16. For mean 
photon numbers above about 10 the phase uncertainty for general states was estimated using the 
asymptotic expression, as this is very accurate for large photon numbers. The phase variances are 
extremely close for photon numbers up to 10 or 100, and it takes a photon number of almost a 
million for the phase variance for squeezed states to be more than twice that for optimised general 
states. 

In order to see if the term | loglog(4n) gives a better estimate of the phase variance, An 2 V(4>) — 
logn is plotted in Fig. 2.17 . According to the result ( 2.135 ), this should have the asymptotic value 

— j loglog(4n) + A, rather than A. As can be seen, the exact values do not converge to A, but 
they do not appear to converge to — j| loglog(4n) + A either. The exact values may converge to 

— | loglog(4n) + A for much larger photon numbers; however, for the range of photon numbers 
shown A gives a better approximation. 

This result was checked by calculating the minimum phase variance exactly from ( 2.128] ), and 
these results are also shown in Fig. 2.17. There is close agreement with —\ lo glog(4n) + A, indicating 
that the asymptotic expression ( 2.135] ) is a good approximation for ( 2.128| ). The discrepancy must 
therefore be due to a slight inaccuracy in Eq. (2.128). In Ref. fl45|] Collett indicates that a correction 



term for (2.128) is 



2l | - 4„) A /^2- 2 "\ 



(2.137) 



where a « 0.1990726 and ri\ is as defined in Eq. ( |2.127 ). Although the results using this correction 
agree with the exact values for very small photon numbers, and reproduce the peak found in the 
exact results, the results again do not converge to the exact values for large photon numbers. In 
fact the asymptotic expression simply using A is still the most accuate for photon numbers above 
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n 



Figure 2.15: The phase variance for optimised squeezed states as a function of mean photon number. 
The exact phase variance is shown as the crosses, the asymptotic expression (logn + A)/(4n 2 ) is 
shown as the continuous line, and logn/(4n 2 ) is shown as the dotted line. 



2.8 




Figure 2.16: The ratio of the phase variance for optimised squeezed states to the phase variance for 
optimised general states. 
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Figure 2.17: The value of 4n 2 T^(0) — logn as calculated using various methods. The exact values 
are shown as crosses, A is shown as the horizontal dotted line and — j loglog(4n) + A is shown as 



the continuous line. The values determined exactly using Eq. ( 2.128 ) are shown as the pluses, and 
those calculated exactly using the additional correction are shown as the dash-dotted line. 



about 10 4 . As none of these corrections appear to give better results, I will omit them in further 
discussion. 



Another test of the theory is the optimum value of £. From Eq. ( 2.131 ) we have 

n = ne 2C « log(4n) - \ log(27r), (2.138) 



where the factor of no has been omitted from the second term. Solving for £ gives 

C = \ log (log(4n) - \ log(27r)) - \ logn. 



(2.139) 



The numerically determined optimum values of ( and the approximate analytic expression ( |2.139| ) 
are plotted in Fig. 2.18 . As can be seen, the numerically determined values are extremely close to 
the analytic result for photon numbers above about 100. 

Note also that for very small mean photon numbers, below around 2.6, the optimum value of £ 
is actually positive. This is a strange result, as the theory given above indicates that £ should be 
negative for reduced phase uncertainty The reason for this would seem to be that for cases where the 
photon number is very small, the circular error contour for a coherent state, as shown in Fig. 2.19] , 
would overlap the X T y 2 axis. This means that there would be a significant contribution to the 
probability distribution for tf> = ±tt. A squeezed state with negative £ would have an error contour 
that extends even further over the X n / 2 axis, resulting in a larger contribution to the probability 
distribution near = ±7r. 

If C is positive, however, the major axis of the error ellipse would be vertical, so the ellipse would 
avoid overlapping the axis. It is reasonable that this would result in a smaller phase variance. 

Note that there is no corresponding justification for using a complex value of £. Complex values 
were also tested, but always gave greater phase uncertainties, even for the very low photon number 
cases. 
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Figure 2.19: Contours of the probability distribution for Xq and X^/ 2 - The contour for a coherent 
state is shown as the continuous line, a contour for a squeezed state with negative £ is shown as the 
dotted line, and with positive C as the dashed line. 
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2.3.3 General Dyne Measurements 

Now I will consider squeezed states optimised for minimum phase variance under more general dyne 
measurements. Again we wish to find ^cos<^, except now we have 

oo 

2cos^ = ^2(1 - h(n)) [\n){n + l\ + \n+l)(n\] . (2.140) 

The measure of the phase variance that I will initially consider is 

V'(<f>) = 2-2 (cosV) . (2.141) 



I will use this rather than the Holevo variance initially, in order to use the results for canonical 
measurements in a straightforward way. This expression is not a sufficiently accurate estimate of 
the Holevo variance for the case of mark I measurements. Therefore, at the end of this derivation, 
the result for this measure of the phase variance will be converted to the Holevo variance. This 
measure of the phase variance can be used for the intermediate steps, as minimising this measure of 
the phase variance is equivalent to minimising the Holevo phase variance. 
Evaluating this measure of the phase variance gives 

oo 

V\4>) = 2 - ]T(1 - h(n)) [{a, (\n) (n + l\a, () + (a, C|n + 1) (n\a, 0] 

oo 

= 2-J2 [<«, CI") (n + l\a, C) + (a, (\n + 1) (n\a, Q] 

n=0 
oo 

- Kn) [(a, C\n)(n + l\a, () + (a, (\n + 1) (n\a, Q] ■ (2.142) 

n=0 

Collett |Q found the first two terms to be approximately 

n + l 



*r2 



I ■ 



2erfc(V2^), (2.143) 



so we therefore only require an expression for the third term. Both a and £ will be taken to be real 
in this derivation, so the number state coefficients of the squeezed state are real, and the third term 
can be expressed as 

oo 

2 h(n)(a, (\n)(n + l\a, 0. (2.144) 

n=0 

Approximating h(n) by cn~ p ', the sum to be evaluated is 

oo 

Ycn' p (a,C\n)(n + l\a,0. (2.145) 

n=Q 

This expression can be evaluated approximately using 

oo 

Y n- p (a, C|n) (n + l\a, C) « (n _p ) . (2.146) 

n=0 

Expanding this in a series around n = n we find 

(n- p ) = ((n + An)- p ) 



\ p(p + l)(An 2 ) ' 



2 n 2 



(2.147) 
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It is easily shown that for squeezed states 

(An 2 ) = a 2 ( M -^) 2 + 2nV, 



n 

n 



so we hnd 



J2 t\n) (n + l\a, C) « 77~ p ( 1 + 



p (p+l) 
2n 



(2.148) 



(2.149) 



This should be correct to leading order; however, the second order term is not necessarily correct, 
as the approximation (2.146) may only be correct to first order. This result is derived in a more 
rigorous way in Appendix A. 2, and it is shown that the second order term is also correct. 
Using this result, the phase uncertainty is given by 



no + 1 



3T2 



477 



2erfc(v2n ) + 2c77 



JR-P 



1 



p(p+l) 



Taking the derivative with respect to n gives 



ono 4n V nno 



2cn 



2n n 



-Pip + 1) 
2n 2 



(2.150) 



(2.151) 



As the second term falls exponentially with no it can be omitted. Then we find that for minimum 
phase variance 

n « 2y/cp(p+ l)n 1 - p/2 . (2.152) 

For < p < 2, no increases with photon number, but does not increase as rapidly as 77, in agreement 
with the assumptions used in the appendix. 

Substituting this result into Eq. ( 2.15C ) gives 



W-p/2-i 



V'(4>) « 2c77- p + y/cp{p + 1) 
Converting this to the Holevo phase variance gives an additional correction term: 



V{<t>) m 2c77- p + y/cp[p+ Vju-^ 2 - 1 + 3c 2 n- 2p . 



(2.153) 



(2.154) 



This correction term is only significant for mark I measurements, where it is of lower order than the 
second term. Thus we see that we obtain exactly the same terms for the phase uncertainty when 
considering squeezed states as we do when considering general states. 



2.3.4 Numerical Results 

These results have been tested by numerically determining the optimum squeezed states for hetero- 
dyne and mark I and II measurements. Because the results are extremely close to those for optimised 
general states, rather than plotting the phase variance, I have plotted the phase variance as a ratio 
to the phase variance for optimised general states. 

The ratio of the minimum phase variance for squeezed states to that for general states using 



heterodyne measurements is plotted in Fig. 2.20. The phase variance for optimum squeezed states 



is never more than 0.3% above the phase variance for optimum general states, and for large pho- 
ton numbers the phase variances converge. In fact, for some photon numbers the squeezed state 
variances are closer to the exact general state variances than those calculated using the continuous 
approximation. 



The results for mark I measurements are shown in Fig. 2.21. Here the squeezed state phase 
variance is never more than about 0.6% above the phase variance for optimum general states, and 
the phase variances again converge for large photon numbers. The results for mark II measurements 
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n 



Figure 2.20: The ratio of the phase variance for squeezed states optimised for minimum phase 
variance under heterodyne measurements to the phase variance for optimum general states. The 
values using the exactly calculated general states are shown as the continuous line, and the values 
using the continuous approximation for the general states are shown as the crosses. 




n 



Figure 2.21: The ratio of the phase variance for squeezed states optimised for minimum phase 
variance under mark I measurements to the phase variance for optimum general states. 
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Figure 2.22: The ratio of the phase variance for squeezed states optimised for minimum phase 
variance under mark II measurements to the phase variance for optimum general states. The values 
using the exactly calculated general states are shown as the continuous line, and the values using 
the continuous approximation for the general states are shown as the crosses. 



are shown in Fig. 2.22 . Here the squeezed state phase variance can be more than 1% above the 
general state phase variance, but the phase variances still converge for large photon numbers. 

Another way of co mpari ng the results for squeezed states and general states is to plot the z 
parameter defined by (2.107). This is displayed for heterodyne, mark I and mark II measurements 



in Figs 2.9, 2.11 and 2.13 . As can be seen, the results for optimum squeezed states and general 
states are almost indistinguishable. This means that the phase variances for squeezed and general 
states agree to better precision than just the first terms that I have derived here. 

In order to compare the squeezed states with the states obtained by general optimisation in a 
more direct way, the number state coefficients for the two cases are plotted in Fig. 2.23 (a). This 
is for the example of heterodyne measurements, and similar results are obtained for the adaptive 
measurement schemes. The two states are extremely close, indicating that the optimum general 
states may be converging to squeezed states. 

On the other hand, if the state coefficients are plotted logarithmically, there are large differences 
between the states [see Fig. |2.23| (b)]. Although the states are fairly close near the peak, the tails 
of the two states have different scalings. These differences persist even for very large mean photon 
numbers. 



Another test of the theory is the values of £ for the optimum squeezed states. Using Eq. (2.152) 
gives the optimum value of £ as 



C= -|logn+ilog |Vcp(p+l) 



(2.155) 



Using this the optimal values of £ for heterodyne, mark I and mark II measurements should be 



Chet 



ilogn 



Ci = -glogn +i log- 

Cii = — log n H — log — . 
su 8 B 4 & 16 



(2.156) 
(2.157) 
(2.158) 
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Figure 2.23: The number state coefficients for optimum states for heterodyne measurements and a 
mean photon number of about 2116. The continuous lines are for the optimised general state with 
fixed mean photon number, and the dotted lines are for the optimised squeezed state. Plot (a) is a 
linear plot, and (b) is a semi- log plot. 
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Figure 2.24: The optimum values of ( for heterodyne phase measurements on squeezed states. The 
numerically determined values are shown as crosses, and the analytic expression is shown as the 
continuous line. 



The numerically found optimum values of £ and these asymptotic expressions are plotted in Figs 2.24 , 
2.25 and 2.26. The values of ( converge to the asymptotic analytic expressions in all three cases. 



The value of £ converges fairly rapidly for heterodyne and mark I measurements, with agreement 
within 0.05 for photon numbers above around 100. The convergence is much weaker in the mark II 
case, and a photon number around 10 5 is required for this level of agreement. 
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Figure 2.25: The optimum values of £ for mark I phase measurements on squeezed states. The 
numerically determined values are shown as crosses, and the analytic expression is shown as the 
continuous line. 




Figure 2.26: The optimum values of £ for mark II phase measurements on squeezed states. The 
numerically determined values are shown as crosses, and the analytic expression is shown as the 
continuous line. 



Chapter 3 

Optimum Adaptive Dyne 
Measurements 



Now I will consider the second area for improvement of dyne measurements: the measurement 
technique (as opposed to the input state). This includes both the local oscillator phase used and 
the final phase estimate. Recall that for a state with a mean photon number of n, heterodyne 
measurements introduce a phase variance of about ^n _1 . This is much larger than the minimum 
intrinsic phase variance, which scales as 1.89 x nT 2 . The mark II adaptive measurements introduced 
in Ref. |35[] give a great improvement on this, introducing a phase variance scaling as ^n" 1,5 . 

This scaling is still far worse than the scaling for minimum intrinsic phase variance. In |34j it 
is shown that the minimum introduced phase variance is t he sam e as for squeezed states optimised 
for minimum intrinsic phase variance. As discussed in Sec. 2.3.1 , this scaling is logn/(4n 2 ). Recall 
that this is not quite as good as that for general optimised states, but it requires very large photon 
numbers for the difference to be significant. 

In this chapter I describe a feedback scheme that improves on the mark II adaptive scheme, and 
is in fact very close to this theoretical limit. It is therefore very close to being the best possible 
phase measurement scheme. It is also possible, under some circumstances, to surpass the theoretical 
limit. This is discussed in Sec. 3.< 



3.1 The Theoretical Limit 

In order to understand how to attain the theoretical limit, we must first understand the reason for 
the theoretical limit. It can be shown that the probability to obtain the results A : B from an 
arbitrary (adaptive or non-adaptive) measurement is 

P(A,B)d 2 Ad 2 B = Tr\pF(A, B)]d 2 Ad 2 B, (3.1) 

where p is the state of the mode being measured. Here F(A, B) is the POM (probability operator 
measure) for the measurement, and is given by 

F(A,B) = Q(A,B)\i,(A,B))$(A,B)\, (3.2) 

where Q{A, B) is what the probability distribution P(A, B) would be if p were the vacuum state 
|0)(0|, and \i/j(A,B)) is an unnormalised ket defined by 

\$(A, B)) = cxp(iS(a t ) 2 - AtS)\0). (3.3) 

This is proportional to a squeezed state Q : 

exp(i J B(at)2 _ Aa t)| ) = (1 - |^| 2 )-* exp ( l« P ,C P >, (3.4) 
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where 

| a p , C P ) = exptaV - a p *a) exp(±C P V - ±C P (a t ) 2 )|0), (3.5) 
and the squeezing parameters are 

P A + BA* 

a = — w ( } 

P _ gatanhjEj 

c, - j^j • W 

Here I am using the superscript P on the squeezing parameters to indicate that they are the pa- 
rameters of the squeezed state in the POM, and not of a physical state. The squeezing parameters 
for the input state will be given as a and £ with no superscripts. Note that this differs from the 
notation in flsT] , where a superscript S indicates the squeezing parameters for the input state, and 
no superscript indicates the squeezing parameters for the squeezed state in the POM. 
In terms of these the POM is given by 

F(A, B) = Q'(A, S)|a p , C P X« P , C P |, (3.8) 

where 

Q'(A,B) =Q(A,B) (1- |5| 2 )~^exp Re (Aa p *) . (3.9) 
If the system state is pure, p = \tp){ij}\ and the probability distribution is given by 

P(A,B) = Q'(A,B)\W\a T ',C P )\ 2 ■ (3.10) 

The greatest overlap between the states, and therefore the highest probability, will be when |a p , £ p ) 
has the same phase as the input state. As all the information about the system from the measurement 
record is contained in the variables A and B, the most probable phase based on the measurement 
record is 

c/> = arg(a p ) 

= &rg(A + BA*). (3.11) 
If additional information is known about the system, it is possible to obtain a better phase estimate. 



This is a far more complicated case, and will be considered in Sec. 3.8. 

For an unbi ased measurement scheme the probability distribution for this phase estimate resulting 
from Eq. ( 3.10|) depends entirely on the inner product between the two states, and not on Q'(A, B). 



To see this, note firstly that if the measurement is unbiased, the vacuum probability distribution 
Q(A, B) will be independent of the phase. Secondly, recall that for the squeezed state \a p , ( p ), if we 
rotate the phase of a p by some angle 6, we can obtain an equivalent phase-shifted state by rotating 



the phase of C P by 26. This means that £ p (a p *) 2 is independent of the phase. From Eq. ( |3.7| ), this 
means that B(a p ) 2 is independent of the phase. Since 

Aa p * = (a p -Ba p *)a p * 

= \a p \ 2 -B(a p *) 2 , (3.12) 

Aa p * and therefore Q'(A, B) are independent of the phase. 

Since the probability distribution for the phase estimate depends on the inner product between 
the two states, the variance in the measured phase will approximately be the sum of the intrinsic 
phase variance and the phase variance of the squeezed state \a p , £ p ) . The maximum overlap between 
the states will be when the squeezed state has about the same photon number as the input state. 
This means that the theoretical limit to the phase variance that is introduced by the measurement 
is the phase variance of the squeezed state that has the same photon number as the input state and 
has been optimised for minimum intrinsic phase variance. Since the phase variance of a squeezed 



•54 



CHAPTER 3. OPTIMUM ADAPTIVE DYNE MEASUREMENTS 



state optimised for minimum intrinsic phase variance is approximately logri/(4ri 2 ) in the limit of 
large n 45 , this is also the limit to the introduced phase variance. 

The photon number of the squeezed state at maximum overlap will be mainly determined by 
the photon number of the input, but the degree and direction of squeezing (parametrised by £ P ) 
will be determined by the multiplying factor Q'(A, B). The multiplying factor can be expressed 
as a function of n p and £ p , for which the same symbol Q' will be used, even though it is a new 
function Q'(n p , £ p ). Here n p is the mean photon number of the state \a p , £ p ) (and will be close to 
the photon number of the input state), and £ p = C P a P /& P is C P with the phase of a p scaled out. 
In p ractic e the multiplying factor tends to be concentrated along a particular line (for example, see 
Fig. 3.21), effectively giving £ p as a function of n p . In order to obtain the theoretical limit, the 
measurement scheme must give a multiplying factor Q'{n p ,£ p ) that gives values of £ p for each n p 
that are the same as for optimised squeezed states. 

We can determine the approximate variation of £ p with n 
estimate how it varies for measurements on a coherent state 
used in the adaptive scheme are unbiased, it is easy to see that the maximum probability will be for 
B real and therefore also A real. These results imply that 



1 1 p in the multiplying factor if we can 
If the intermediate phase estimates 



A(l + B) 
1~B 2 
A 



1 - B 



(3.13) 



This means that B should be 



so £ P should be 



B « 1 



A_ 

7vP' 



-atanh(l p) 



(3.14) 



(3.15) 



Note that when a p is real, C P and £ p are equivalent. Provided the photon number is large, B should 
be close to 1 so we can use the asymptotic approximation of the atanh function, giving 



2 l0g 2^' 



(3.16) 



The mean photon number for squeezed states is given by 



n p = 



sinli |C F 



(3.17) 



For the states that are considered in this study, sinh 2 I £ p I is much smaller than n p for large photon 



numbers, so a r 



/— p 
' n . 



Using this approximation gives C P as 



- log — 

2 2 



.4 



(3.18) 



Since the magnitude of £ p is governed by the multiplying factor Q'(n p ,£ F ), this result for £ P should 
hold for more gener al input states. 

From Sec. [2.3.1 and Ref. 
mately by 



the intrinsic phase variance of a squeezed state is given approxi- 
1 



v(4>) 



where uq = ne 2 ^ and £ is real. 



no 
4n 

This is minimised for 



=2- + 2erfc(V2n ), 



n 



log(4n) - jlog(27r). 



(3.19) 



(3.20) 



Using the result obtained for £ in Eq. ( 3.18[ ) gives 
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so 



(3.22) 



As the optimum value of Uq is given by Eq. ( |3.20 ), in order for the measurement to be optimal, \A\ 



should scale with 



n as 



\A\ 



log n F 



(3.23) 



Since n p ss n, we should get the same scaling with the input photon number. For the case of mark 
II measurements there is the result that \A\ = 1 [ [jE] , which is why t hese measurements are not 
optimal. Note that if we substitute |^4| = 1 into the expression ( |3.22|) to find Hq, and substitute 
that into Eq. ( 3.19 ), we obtain the correct result for the mark II introduced phase variance, 



AV(0) 



1.5 



(3.24) 



3.2 Improved Feedback 

Now we have the result that for optimal feedback \ A\ should decrease with photon number. Therefore, 
in order to improve the phase measurement scheme, we want one that gives \A\ < 1. To see in general 
how this can be achieved, consider a coherent state with amplitude a. The Ito SDE for \A V \ 2 will 
be 

d\A v \ 2 = A* v (dA v ) + (dA* v )A v + (dA* v ){dA v ) 



A* v e L ^ v) I{v)dv + e-^ {v) I(v)dvA v + dv 
\A v \I{v) (V* w e-^ 4 + e -**M e ^) + 1 



dv, 



(3.25) 



where tp^ — arg A v . Usually the local oscillator phase will be based on a phase estimate (p v , so that 
$(v) = ip v + ir/2. In terms of this phase estimate the differential equation becomes 

d\A v \ 2 = [\A v \I(v) fie^e'^ - ie^e***) + l] dv 

= [1 + 2\A v \I(v) sinfotf - <p v )} dv. (3.26) 
Taking the expectation value of I(v) and simplifying gives 

(/(«)) - ^2Re (ae- 1 *^) dv + dW{v)^ / dv 

= -2\a\wa.($ v -<p), (3.27) 

where ip = arg a. Using this result, the expectation value for the increment in \A V \ 2 is 

(rf|A u | 2 ) = [l-4|A u ||a|sin(^-^)sin(^-^)]du. (3.28) 

Note that here the average is only over the single increment dW(v), and not over the different 
trajectories. 

The first term on its own will give \A\ — 1, and in order to get \A\ < 1 the two sines must have 
the same sign. This will be the case if the phase estimate is between the actual phase and the phase 
of A v . Therefore we would like to use the phase estimate 



^( U )=arg(a 1 ^A^)), 



(3.29) 



where < e < 1. The problem with this phase estimate is that it uses the actual value of the phase. 
In order to avoid this problem, the best estimate of the phase can be used in place of the actual 
phase. Therefore the phase estimate that will be considered is 



0(v)= 3 xg(C 1 v -^A^), 



(3.30) 
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where 



A v v + B V A* V . 



(3.31) 



It would at first appear that the best value of e to use is 1/2, as this will make the phase estimate 
exactly halfway between the best phase estimate and s.vgA v , and this was the initial improvement 
on mark II feedback that was tried. This is too simplistic, however, as will be shown. In order to 
estimate what the best values of e to use are, we can take the actual phase to be zero, and use the 
phase estimate 

0(v) = sa.TgA v . (3.32) 

For simplicity e will be ta ken t o be constant. For this value of the phase estimate, ip£ = v /e. 
Substituting this into Eq. ( |3.28 ) gives 

(d\A v \ 2 ) = [l-4| J 4„|asin(^)sin[^(l/e-l)]]du. (3.33) 

The absolute value symbols on a have been omitted because the phase has been taken to be zero. 
When the phase estimate is close to zero we can use the linear approximation of the sine function 
to get 

(d\A v \ 2 ) = [1 - A\A v \a0 2 v (1/e - 1)] dv. 
A differential equation for V can be obtained in a similar way: 



(3.34) 



d0 v — elm [din Aj] 



elm 



elm 



dA, L 
A. r 



(dA v y 
2AI 



ie l ^I(v)dv (ie l ^I(v)dv 



2 1 



\A v \e^/c 2(|A ) |eWe) 2 _ ' 

Note that here A v = \A v \e l ^ v ^ e has been used. Continuing the derivation we find 

' ie i(p v {x-\jB) I ( w ) dy e 2#„(i-i/s) (j ( v ) dv f~ 



(3.35) 



dip v = elm 



14,1 

e 

UQ 

e 

UQ 

e 



141 

cos [0 V (1 — 1/e)] I (v) dv + 



2\A V \ 2 
sin [20 v (1 — 1/e)] dv 



2\A V \ 



cos [0 V (1 — 1/e)] [— 2asin0 v dv + dW (v)] + 



sin [20 v (1 — 1/e)] dv 



2 A 



, k, sin \20v (1 — 1/e)] dv , , , 
-2a sin V cos [0 V (1 - 1 /e)] dv + L - \ . , — + cos [0 V (1 - 1 /e)] dW(v) 



2\A V \ 



„ ~ , v (l — l/e)dv j . . 
-2a0 v dv + — + dW(v) 

A, 



(3.36) 



In the last line the linear approximation for sine and a constant approximation for cos have been 
used. Evaluating the increment in the square of the phase gives 



(dfil) = (2(p v d(p v + {d0 v )' 



2e 

e 



-2a0 2 dv 



-Aa0l 



01(1- 1/e) dv 
14] 

2^(1 
141 



141 

Thus the expectation values for the increments in \A V \ 2 and 2 are: 
(d\A v \ 2 ) = [1 - la\A v \0 2 v (1/e - 1)] dv, 

20 2 v (l -1/e) , 



I4I 2 

dv. 



dv 



(d0 2 v ) 



141 



-Aa0 2 v 



141 



141 



dv. 



(3.37) 

(3.38) 
(3.39) 
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Now an approximate solution for \A V \ 2 and ip 2 can be obtained by solving these as differential 
equations, ignoring the fact that these are only the expectation values for the increments. The 
approximate solution for large photon number is 



\A V \ 2 = ev + 0(a- 1 ) 

- F- 

4a V v 



ft 



0(a 



(3.40) 
(3.41) 



To see that these are solutions, we can just substitute them into the above equations. For Eq. ( 3.3g| ) 
we have 



l.h.s. = 



d\A v \ 2 

[e + 0(a- 1 )] dv 



.h.s. = 



l-4a(v^+0 (a" 1 )) ( 



(-L.fi 



4a 



O 



(«- 2 ))(l/e-l) 



(3.42) 



dv 



1 ~{V^+0{a- 1 )) ^jl +0 {a- 1 )^ (1/e-l) 



dv 



[l-(e + 0(a- 1 ))(l/e-l)]dw 
[1- (1 -e + 0(a- 1 ))] dv 
[e + 0(a- 1 )] dv. 



For Eq. fl3.39| ) we find 



l.h.s. = O (a 
r.h.s. = 



dv 



(3.43) 



(3.44) 



(v^+0(a-i)) 



--in ( J-Ji + 0(a- 2 ' 



4a V v 



2(^yf+0(a- 2 ))(l-l/ £ ) 



( % /^+0(a- 1 )) (V^+0(a-!)) 
J + 0(or*)) [- (yf + 0(a- 1 ))+0(a- 1 ) + ( 

^+0(a- 1 ))0(a- 1 )^ 



Ofa- r 



©(a^ 1 )^. 



(3.45) 



For this equation it is enough that the sides agree to order a -1 . To obtain more specific agreement 
the higher order terms in the solution would be required. 

As a simple check on these results, note that the case e = 1 is just the standard case and the 
final phase estimate corresponds to the mark I phase estimate. In this case the solutions give to first 
order 



1^1 



= 1, 



(3.46) 
(3.47) 



These are exactly the same results as obtained in Ref. ]35| . 

The important result is that for the final value of A v , at v = 1, we have |j4| w \fe. This indicates 
that we can obtain smaller and smaller values of \A\ simply by using smaller values of e, and the 
minimum is not for e = 1/2. To see the reason for this, note that 
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10 10 10 10 

a 

Figure 3.1: The mean values of \A\ 2 divided by e for measurements on coherent states with e arg A v 
feedback. The results for e = 1/2 are shown in dark blue, for e = 1/4 in green, for e = 1/8 in red, 
for e = 1/16 in light blue, for e = 1/32 in purple, for e = 1/64 in yellow, for e = 1/128 in black, for 
e = 1/256 as a dashed dark blue line, and for e = 1/512 as a dashed green line. 



This means that fo r sm aller values of s, the value of ip^ will tend to be greater. This means that 
the second sine in ( 3.28 ) will be greater on average, resulting in a smaller value of \A\. 

For other values of e these results have been verified by numerically performing the stochastic 
integrals for coherent states. The numerical technique is very straightforward for coherent states. 
We simply replace the infinitesimal interval dv with a finite interval Sv, and replace the Wiener 
increment dW with a finite stochastic increment SW that has a Gaussian distribution with zero 
mean and variance 8v. For these calculations 100a time steps and 2 10 samples were used. 



The results for \A\ 2 are shown in Fig. 3.1. For simplicity, rather than plotting \A\ 2 , I have shown 
\A\ 2 /e. From the theory above this should converge to 1. The results for moderate values of s agree 
very well with the theory; however, for smaller values of e there is poor agreement unless the photon 
number is very large. For e = 1/64, a value of a above about 512 is required for good agreement. 
The photon number required for good agreement with theory appears to scale roughly as £~ 3 . 

In Fig. |3.2| are shown the results for the final variance of the phase estimate. For simplicity, 
ip 2 /y/e is plotted. From the theory above this should be approximately l/(4a). Similarly to the 
case for \A\ 2 , the results for e = 1/2 agree very well, but those for smaller values of e require large 
photon numbers in order to have good agreement. 

The next question is whether this good agreement cont inues if the best phase estimate is used in 
the feedback, rather than the actual phase, as in E q. (3.30 ). The calculations were repeated for this 
feedback, and the results for \A\ 2 are shown in Fig. fc.3| . The results are very similar to the previous 
case, except that higher photon numbers than before are required to obtain good agreement in this 
case. In this case the photon number required for good agreement with theory appears to scale 
around e~ 2 . 

These results indicate that |A| can be reduced to any value required, provided the photon number 
is suffici ently large. This implies that the results should be close to optimum for the feedback given 
by Eq. ( |3.30 ), with e given by 

e= (2a opt e 2C - t ) 2 , (3.49) 
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10 1 10 2 10 3 10 4 



a 

Figure 3.2: The mean values of ip 2 divided by %fe for measurements on coherent states with e arg A v 
feedback. The results for e = 1/2 are shown as crosses, for e — 1/4 as pluses, for e = 1/8 as circles, 
for e = 1/16 as squares, for e = 1/32 as diamonds, for e = 1/64 as triangles, for e = 1/128 as 
asterisks, and for e — 1/256 as stars. The analytic result of l/(4a) is shown as the continuous line. 
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10 10 10 10 

a 

Figure 3.3: The mean values of \A\ 2 divided by e for measurements on coherent states with 
azgCl~ £ A e v feedback. The results for e = 1/2 are shown in dark blue, for e = 1/4 in green, for 
e = 1/8 in red, for e = 1/16 in light blue, for e = 1/32 in purple, for e — 1/64 in yellow, for 
e = 1/128 in black, for e — 1/256 as a dashed dark blue line, and for e = 1/512 as a dashed green 
line. 
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where Copt is the value of £ optimised for minimum intrinsic phase variance for the photon number 
n. Here a has been used rather than y/fi, as it should be closer to a p . This is the first feedback 
technique that I will discuss. 



3.3 Simulation Method 

The easiest input states to use for numerical simulations are coherent states, as they remain coherent 
with a deterministically decaying amplitude. We simply have the equations (after the time variation 
of the amplitude is scaled out) 

I(v)dv = 2Re(ae~ i ^)dv + dW{v) 
dA v = e i9 ^I(v)dv 

dB v = -e 2l ^dv, (3.50) 

where 3>(u) = tp v + 7f/2, and (p v is obtained by the feedback technique that we are using. These 
equations can be integrated numerically by using a finite increment Sv, and treating dW(v) as a 
finite random variable with a normal distribution and variance 5v. 

However, in order to estimate the phase variance that is introduced by the measurement, coherent 
states would be very inefficient, as the phase variance would be dominated by the intrinsic phase 
variance. It is almost as easy (and much more efficient) to perform calculations on squeezed states, 
as squeezed states remain squeezed states under the stochastic evolution, and only the two squeezing 
parameters need be kept track of. The best squeezed states to use are those optimised for minimum 
intrinsic phase variance. For these states the total phase variance will be approximately twice the 
intrinsic phase variance when the measurements are close to optimal. 

In order to determine the stochastic evolution of the state, I will use a technique similar to that 
used in Ref. |Q . The master equation for detection of photons is 

p = —i [H, p] + \apa) — ^ (a) ap + pa) a)] dt. (3.51) 

For a consistent detection scheme the measurement operators must give this master equation. In 
general the master equation is given by 

N 

p(t + dt) = J2^np(t)nl. (3.52) 

71=0 

For simple detection the measurement operator is 

n^Vdia. (3.53) 

In order for this to be consistent with the master equation, the measurement operator for no detection 
must be 

Q = 1 - (iH + ia f a) dt. (3.54) 

To consider measurements where the field is combined with a large amplitude coherent state that is 
treated classically, we can use a unitary rearrangement of the measurement operators. Using that 
given in (33) we obtain 

fh = \fdl,(a + 7) 

fi = 1 - [iH + i(a f + 7*)(o + 7) + - 7« f )] dt. (3.55) 
The operator for no detection can be simplified to 

n = l-(iH+ \a^a + 7 *a + i| 7 | 2 ) dt. (3.56) 
When there is a detection the state changes to 

m + d t))= w (3.57) 
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Now the state after a detection can be multiplied by an arbitrary complex constant with magnitude 
1 without altering the properties of the state. Therefore this state can be altered to 



\i/j(t + dt)) = 



(ae 



-i<E> 



mm) 



where $ is the phase of the local oscillator. This means that the change in the state is 

(ae"** + | 7 |) 



d\m) 



y<(ot+7*)(o + 7)) 
When there is no detection the unnormalised state changes to 

\4>(t + dt)) = [l - (iH + §a + a + 7 *a + ±| 7 | 2 ) dt] |^(t)). 

We find that 

$(t + dt)\j){t + db)) = ([l-{-iH+\a)a + la ) + ±\ 1 \ 2 )dt] 

x [l- {iH +^a + 1 *a+\\ 1 \ 2 )dt\) 
= (l - (a) a + 7*a + 7a 1 + | 7 | 2 ) dt) 
= 1 - ((a f a) + 7*<a) + 7(a f ) + | 7 | 2 ) dt. 

Therefore, to obtain a normalised state, we must multiply by a factor of 

$(t + dt)\i){t + dt))- 1 / 2 = 1 + | ((a f a) + 7* (a) + 7(a f ) + It| 2 ) dt. 

This means that the normalised state changes to 

(a^a) a^a (7*0 + 70^) 



\ip{t + dt)) 



so the difference in the state is 



j*a~iH dt 



The probability of a detection occurring in time dt is 

Pi =dt({a ] + 7 *)(a + 7)) 



(3.58) 



(3.59) 



(3.60) 



(3.61) 



(3.62) 



(3.63) 



d\m) = ^ + ^±2^1 - 7*- iff) W)>. (3.64) 



(3.65) 



In order to determine a stochastic differential equation for the state, we define a stochastic increment 
dN which takes the values (for no detection) and 1 (for detection) and has the expectation value: 

E(dN)^dt((a ii +7*)(a + 7)>. (3.66) 

Using this increment, the SDE for the state taking into account both alternatives is 

(ae" 1 * 



d\m) 



dN 



+dt 



+ W) 



- 1 



y(( a t + 7 *)( a + 7 )) 

(a^a) a) a (7*0 + 70^) 



7* a — iH 



\m)- 



(3.67) 



When dN takes the value 0, we simply have the increment for no detection (3.64). When dN takes 
the value 1, we can ignore the term for no detection, as it is proportional to dt and is infinitesimal. 
This means that we obtain the increment for a detection given by Eq. ( 3. 59] ). 
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Note that Eq. (3.67) differs slightly from the result given in [^3|. This is because the state for 
a detection was multiplied by e _I * here. This is necessary in order to obtain a simple result in 
the limit of large |7|. To take this limit we can approximate the increment dN in terms of Wiener 
increments dW: 

dN = Kdt + ^dW, (3.68) 

where 

K=((o t +7*)(o + 7)). (3.69) 



This approximation is used because it gives the correct expectation value fl3.66| ), as well as (dN) = 
(dN 2 ). Expanding the SDE for the state in terms of this we find 



Am) 



(ndt + y^dW) 



(ae 



-i<E> 



l7l) 



dt 



V((at+7*)( a + 7 )) 
(at a) at a (-f*a + ja') 



7* a — iH 

— i$ 



2 2 2 

= [(-Kdt - ^dW + y^ae - ** + \j\)dt + (ae 
(at a) a) a (7*0 + 70^) 
2 2~ + 2 



dt 



7* a — iH 



W)) 

\l\)dW) 

\m)- 



(3.70) 



Expanding k gives 



K= ((a f +7*) (« + 7)) 
= (<va + a^7 + 7*a + |7| 2 ) 



|7| 2 



2x (oto) 
M l7l 2 



where 

We can expand \[k as 



X = § (ae" 4 * + ate'*) . 



(3.71) 
(3.72) 



/« = 17^ 



1+ 2x , (oto) 



l7l bf 



l7l ( 1 + A ' <ata> X ' 



l7l 2| 7 | 2 2| 7 | ; 



(3.73) 



Now using this in the expression for the SDE 

d\m) = 



X , (a f a) X 2 



dW 



-|7l(l 
-dt 



M 2| 7 | 2 2| 7 | 2 , 
(at a) at a (7*a + 7at) 



2| 7 | 2 

\j\)dt + (ae-^ + \j\)dW 



7* a — ii? 



W0> 



-M 
+ I7I 2 ( i - 



2 / 2x (oto) 
' l7l I7I 2 



-?'<!> 



X , ae l *x 



(oto) 



l7l l7l 
(ata) at a 



l7l 2 



2| 7 | 2 2| 7 | : 



MX - Mae" 



= [(< 



„— i<E> 



X )dW - (iH + iota - ae-** X + |x 2 ) W*)>- 



(3.74) 
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These results are for the case where the field is directly combined with a local oscillator field, 
and there is a single photodetector. This was done for simplicity and for consistency with Ref. [^3| . 
The p artic ular experimental configuration that we would like to consider, however, is that shown 
in Fig. |l.2| . Here the field is combined with the local oscillator field via a 50/50 beam splitter, and 
there are two photodetectors at the two outputs of the beam splitter. This case turns out to be 
entirely equivalent, as I will now show. 

In order to consider this case we can't perform a simple unitary rearrangement of the measure- 
ment operators because we need 3 operators, for detection at one arm, detection at the other arm, 
and no detection. As a simple extrapolation from the previous case we can take the two measurement 
operators for detection to be 



dt , . 
2> + 7) 



(3.75) 



In order for this measurement scheme to give the correct master equation, the measurement operator 
for no detection must be 

120 = 1- (iH+\<Ja + \\i\ 2 )dt. (3.76) 

The above results for a single photodetector can be simply extended to two photodetectors. The 
change in state for a detection at photodetector 1 is 



dllM*)) = 



(N 



- 1 



\m), 



(3.77) 



and at photodetector 2 is 



| 7 |-oe-**) 



- 1 



y<(at- 7 *)(a- 7 )) 
When there is no detection the unnormalised state changes to 

$(t + dt)) = [1 - (iff + \a)a + ±| 7 | 2 ) dt] |V>(t)). 

We find that 



(3.78) 



(3.79) 



$(t-\-dt)\i>(t + dt)) = ([1- (-iff + \<Ja+ i ItI 2 ) dt] [l - (iff- 
= (1- {a)a + \-f\ 2 )dt) 
= 1 - « a t a ) + \^) dt , 

so the normalising factor is 

$(t + dt)\J>(t + dt))- 1 ' 2 = 1 + \{{a)a) + \j\ 2 )dt. 
This means that the normalised state changes to 



^a+^\ 2 )dt]) 



\ip{t + dt)) = 
so the difference in the state is 

d\ip(t)) = dt 



a' a 



iH dt 



\m), 



(at a) 



-iff IVW). 



(3.80) 



(3.81) 



(3.82) 



(3.83) 



64 



CHAPTER 3. OPTIMUM ADAPTIVE DYNE MEASUREMENTS 



Then the increment in the state taking into account all three alternatives is 

(M+oe-**) ^ ldN ( (N-ae-**) 



d\m) 



dN + 
+dt 



v /((a t +7*)(« + 7)> 
in 



1 



W*)>- 



^/(( a t- r )( a - 7 )) 



2 2 

Now rather than defining a single n, we can define 

K± = ((at± 7 *)(o±7)). 

In terms of this the increment is 



d\m) = 



dN+ 
+dt 



(a' a) a) a 



2 2 

For large |7| we can use the approximation 



-ui\ \m)- 

,/y = 1± dt+ l1± dW± . 



(N- 



Substituting this in gives 



d\m) 



(M-oe-**) 



1) +dt 



(a)a) a^a 
~ 2~ 



iH 



^t(| 7 |+ae-»)dt+-L(| 7 | 



+ 



(|7| 



-i<I>\ 



dt + ^= (|7| - «e- ,$ ) rfVK_ - ^dt - J^-dW. 

V ^ 



+di 



iH 



2 2 

Similarly to the previous case we have 



«± = |7| : 



I i7i + m 2 y' 



and 



Substituting this in gives 

|2 



^-|7l(l±^+ <atfl> " 



2|7l 2 2| 7 p 



d|V(*)> = 



2 



1 + A + 



X 



|7| 2| 7 | 2 2| 7 | 5 



1 + 



ae 



|7| 



ae 



# + M f 1 + 

V2 V l7l 



i<I> 



(3.84) 



(3.85) 



(3.86) 



(3.87) 



(3.88) 



(3.89) 



(3.90) 



dW+ 



|7| 



h\ 2| 7 | 2 2| 7 | 5 



+ - 



1 - 



| 7 | 2| 7 | 2 2| 7 | : 



- t<i> 



1 - 



I7I / ae-**\ 
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2 

+dt 

hf 

2 

2 

, bf 

2 
2 

+di 

-M 

+di 

M 2 ( 

+di 



1 



It I 17 

"2 2~ ~ 



df 



SO 



l7l 



(a f a) _ 
2|7| 2 2| 7 P 



dW_ 



1 



hi 

M |7 



x_ 



axe 



IV>(*)) 



(gtg) 

l7l 2 ' W 

VItI H 



2|7l 2 
(at a ) 



aY 



l7l /ae-' 1 
V2\ M 



dVF+ 



171 



axe 



-,;<!> 



2|7l 2 2| 7 p 

2 

' dt 



X 



(ata) 



l7l 

(era) era 



df 



7| 2 2| 7 P 2| 7 | 2 
V2\ l7l 2| 7 | 2 " 



dW + 

JtI 

V2 



2|7| : 



-iff 



axe 



- i<i> 



M*)> 
x 2 



|7| 2 2| 7 
(ata) 



1 ' l7l 2 
a) a) a'a 



dt-Hl 



V2 



-iH 



( axe 



X 



(ae 



V l7l 2 2| 7 | 2 
(a^a) a^a 
~~ 2 2~ 
(dW+ - 



2| 7 p 



2|7| 2 

ItT 

dt-[ 



dt 



|7| 
v/2 



- 7 <I> 



l7l 



dVF_ 



A I dW. 



V2 



(ae 



-;'<!> 



x)(dW+ - dW-) 



-i'I> 



-iH 
- dW- 



\m) 



(iH + ^cra — a\e 



\m)- 



Now we find that the combination of the two stochastic increments is equivalent to a third 

(dW+ - dW-) 



(3.91) 



dW 



V2 



X )dW - (iH + I^a - axe" 4 * + ± X 2 ) dt] |V(*)>- 



(3.92) 
(3.93) 



In terms of this we find 

dm)) = [(ae" 1 * 

This case is therefore exactly that same as the case where the fields are combined directly and there 
is only one photodetector. 

Note also that the POM derived in Ref. [^3) was derived for the case where the signal is combined 
directly with the local oscillator, and there is one photodetector. It turns out that this POM also 
applies to the case with a 50/50 beam splitter and two photodetectors. This can be shown by 
performing the derivation in a similar way to Ref. In general, we can consider the stochastic 
evolution of an unnormalised state vector 

n r (T)\m) 



\lp r (t + T)) 



(3.94) 



VAAT) 

where A r (T) is the ostensible probability for the result r, and the actual probability is given by 

P r (T) = A r (T)(Mt + T)\Mt + T)). (3.95) 
We can use this strategy on the above measurements, with the ostensible probabilities given by 

A±(di) = i| 7 | 2 dt, (3.96) 
A (dt) = 1 - | 7 | 2 di. (3.97) 
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In this case the increments for the two detection cases are 



7 

and the increment for the case where there is no detection is 

e#o(i)) = -dt (lota + iH) \ip(t)). 
Writing out the evolution explicitly 



dN+(t)- - dN-(t)~ - dt (iH + Wa) 

1 1 



d\m) = 

Now we can approximate the Poisson process by a Gaussian process: 



dN ± = \\i\ 2 dt+ fl±dW±{t) 
v2 



Using this gives 

d\m) = 



-^-=dW + (t)ae^ - -^dW-(t)ae- t ' s ' - dt (iH 
v 2 v2 



+ ±a f a) 



W)) 



—=ae- 1 '*' (dW+(t) - dW-(t)) - dt (iH + 
v2 



m))> 



Now replacing (dW+(t) - dW-(t))/V2 with dW(t) as before, we obtain 

d\$(t)) = [ae-^dW(t) - dt (iH + i^o)] |V(i)>- 
Then the recorded photocurrent is given by 

5N+ - 5N- 



lit) = lim lim 

St^O | 7 |-+oo 



lim 

1-rHoo \j\dt 
±(dW + (t)-dW.(t)) 



\l\5t 

i±dW + (t)-^LdW.(t) 



dt 



dW(t) 
dt 



(3.98) 



(3.99) 



(3.100) 



(3.101) 



(3.102) 



(3.103) 



(3.104) 



Note that, although the limits are noncommuting when performed on the increments SN±, they are 
commuting for Gaussian increments. It is therefore reasonable to perform the time limit first, as in 
the second line above. 

From this point forward the derivation is identical to that given in Ref. |33| , and I will therefore 
not give it explicitly. Note that the above expression for I(t) is obtained using the same definition as 
when we are considering the actual probabilities, and not the ostensible probabilities. The increments 
dN± are chosen with the ostensible probabilities here, resulting in an expression for I(t) that only 
involves the stochastic part. 

Therefore we find that, similarly to the case for a single detector, the POM for the parameters 
A and B defined by 



A = 



^e- s ' 2 I(s)ds, 



B = - I e 2l ^e- s ds, 
o 



(3.105) 
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is 

F(A,B) = Q(A,B)$(A,B)){i,(A,B)\, (3.106) 

where 

\i>{A, B)) = exp {\B(atf + Act) |0), (3.107) 

and B) is the ostensible probability distribution. This derivation is for the case H = 0, 
corresponding to a freely damped cavity. From this point forward I will take H = 0, rather than 
continuing to include this term. 

Now we can determine the stochastic evolution of a squeezed state. To do this I use the method 
of Rigo et al. J5^] . Squeezed states obey the relation 

(a-BfJ-A s t )\Af,Bf) = 0. (3.108) 

Here the squeezing parameters Af and Bf are defined so that they are analogous to the parameters 
A and B. They are related to the usual squeezing parameters by 

B s = _ Cttanh|Ct| 



ICt 

s n s 



A$=at- Bf (a t y . (3.109) 

This is the same as the relation between A and B and the parameters a p and £ p for the squeezed 
state in the probability distribution. 

If the squeezed state remains a squeezed state under the increment d\ip), then it can be shown 



from Eq. ( |3.10S| ) that 

(a - Bf a* - A t s ) d\ip) = (dflf a f + (L4 t s ) (3.110) 

where the Stratonovich formalism has been used. If we convert the stochastic increment for the 
state into the Stratonovich form and substitute it into the above equation, then we can obtain an 
SDE for the squeezing parameters. To convert from the Ito to the Stratonovich form we make the 
replacement 

XdY -> XdY - \dXdY (3.111) 
Using this on the above SDE gives the Stratonovich increment 

d\i>(t)) = [(oe"** ~ X )dW - (\<Ja - axe'™ + \ X 2 )dt 

-\ (adfe - **) - d X ) dW] |V>(i)) - \{ae- 1 ^ - X )dWd\^{t)) 

= [(ae- 1 * - X )dW - (±a 1 a - a^e"** + \x 2 )dt 

_i (od(e-**) - d X ) dW\m) - U ae ^ ~ X?dt\m) 

= [(ae-^ - X )dW - {\a) a + \a 2 e' 2 ^ - 2a X e~ l * + X 2 )dt 

-i(ad(e- i *)-d X )dW] (3-112) 

Here the increments d(e~ 1 ®) and dx have been included because the phase of the local oscillator can 
vary stochastically. 

In order to determine the SDE for the squeezing parameters, it is convenient to firstly determine 
the effect of (a — Bf at — Af ) on each of the operators in this equation. 

(a-B t V - A s t )C\B?,A s t ) = C(a B t V A*)\B?,A*) 

= (3.113) 
(a - B t V - A s t )a\Bf, A s t ) = [a(a- B t V _ A f) + B f] \ B f iA S) 

= Bf\Bf,Af) (3.114) 
(a-B?J-Af)a 2 \B!,Al) = [a(a - Bf<J - A?) + Bf] a\Bf, A s t ) 

= {a [a(a - B t V - A?) + Bf] + aBf} \Bf,Af) 

= 2aBf\Bf,Af) 
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= 2B?(Bfa^ +A?)\Bf,A s t ) (3.115) 
(a-B?al-AfWa\B?,AS) = [a^a - B t V - Af) + l] a\B*,A*) 

= {at [a(a Bfa^ Af) + Bf] + a} \Bf , Af) 
= (a^Bf + a)\Bf,Af) 

= (2a^Bf + Af)\Bf,Af) (3.116) 
Here C is any scalar constant. Using these results it is straightforward to show that 

(a - Bfat - A*)d\m) = { [-|(2a 1 l Bf + Af) Bf (B f V + Af)e~ 2i * + 2^f X e~ i9 ] dt 

+Bfe- i *dW - \B^d{e-^)dW) \i/>(t)) 
= (dBfa^+dAl)m)). (3.117) 
The Stratonovich SDEs for the squeezing parameters are therefore 
dBf = (-Bf (Bffe-™)dt 

= -Bf(l + e- 2i *B?)dt (3.118) 
dAf = (-\Af - BfA t s e" 2i * + 2B? X e- l *)dt + Bfe~^dW - \Bfd{e-^)dW 

= [-\Af - BfAfe^ + Bf((a)e- 2 ^ + (at))] dt + Bfe'^dW - \Bfd{e~^)dW 
= {-\Af BfAfe~ 2i * + Bf [(Bf (at) + Af)e~ 2i * + (at)] } dt 

+Bfe~^dW - \Bfd(e~^)dW 
= -\Afdt + Bf (at)(l + e~ 2i *Bf)dt + Bfe'^dW - \Bfd{e~^)dW. (3.119) 
In order to convert back to the SDE for the standard (non-scaled) amplitude, note that 

_ Af + BfjAfy 

at l-l£fl 2 ' ( } 

Using this expression, the Stratonovich increment in at is 

dAf + BfdAf* Af*dBf , nSstJoS D s Jn sa Af+BfAf* 

** = + r^sfp + lBf dBf + BfdBf ) (fr^f 

= [dAf + BfdA*- + AfdB? + a, (Bf'dBf + BfdB?')] ' (3.121) 
Expanding this gives 

don = {-\Afdt + Bf (at) (1 + e- 2 ^Bf) dt + Bfe'^dW - \Bfd{e^)dW + Bf [-\Afdt 
+Bf* (a) (1 + e 2i *Bf*) dt + Bfe^dW - ±Bf d(e J<I, )aW] - Af Bf (l + e~ 2i * B*) dt 

-a t (Bf B? (1 + e^Bf) dt + Bf B f* (l + e 2 ^ Bf*) dt) } 

= -±a t dt + {Bfa* t (1 + e" 2i *Sf) dt + Bfe'^dW - \Bfd(e~^)dW 

+Bf [Bf*a t (1 + e 2 ^Bf*) dt + Bf*e l *dW - \Bf*d{e^)dW] - Af* Bf (l + e" 2 ^Bf) dt 

-a t (BfBf (1 + e" 2i *Bf ) dt + BfBf* (l + e 2 ^ Bf*) dt) } 
= -\a t dt + {Bf [a* - Bf*a t ] (l + e - 2l *Bf ) dt + Bf e^dtU + BfBf*e^dW 

-\Bfd (e-**) dW - £B?Bf d(e**)dW - Af Bf (l + e - 2i *B t s ) dt} 1 _ ^ S|2 
= -\a t dt + {Bf Af* (1 + e- 2i *Bf ) dt + Bfe'^dW + BfBfe^dW - \Bfd(e~ ii5> )dW 

-\BfBf*d{j*)dW ~ AfBf (1 + e^Bf) dt} 

= ~\o*dt + (BfV* + c-») - (Bf d(e») + d(e"»)) . (3.122) 
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Converting back to the Ito form of the SDE gives 

dat — ~^a t dt 



E * dW 2 (B!*j* + r->» 

i-\Bfr 



(3.123) 



The SDE for Bf is unchanged. Note that in converting back to the Ito form, the extra term involving 
the increment in the phase of the local oscillator cancels out. 
Now for consistency with Ref. |?3) I will take the signal to be 



I(t) = lim lim 
st-*o | 7 |^oo 



SN + - 5N- 

h\sT~ 



(3.124) 



In order to evaluate this, recall that the expression for the photon number counts at the two pho- 
todetectors is 



where 



dN ± = ^dt + JfdW±, 



2 X , (at a) 



h/l 



and the expansion of the square root is 



— ill-, X , (a f a) X 2 



l7l ' 2| 7 | 2 2| 7 P 
Using these expressions gives the photocurrent as 



(3.125) 
(3.126) 

(3.127) 



I(t)dt = lim 

|7|->oo 



lim 

|7|->oo 



l7| ! 



1 



2 X , (a^a 



17 1 



l7l 



X , (a 1 a) X 2 



7 | 2 J dt+ V2 I 1 ' |7l ' 2| 7 | 2 2| 7 | 2 



dW+ 



2x («M 
M M 2 



JtI 
v/2 



X , (a f a> X 2 



h/l 



2 / 2x 



l7l 



dt - 



hi 



hi 



a/2 V2 



l7l 2| 7 p 2| 7 p 
1 

7l 



1 

N 



2xdt + {dW+ - dW- 
v2 

2 X a7 + dW 



2Re ((a) 



dt + dW. 



(3.128) 



This result is entirely general and does not depend on the state being coherent or squeezed. In the 
case of a squeezed state we have 

(a) = a t . (3.129) 
Now note that the deterministic part of the SDE for a t is 

(dat) = — \ot t dt. 

This means that the deterministic part of the evolution is 

-t/2 



at oc e 

In addition, the parameters A t and B t are defined in Ref. [|33"| by 



(3.130) 
(3.131) 



B t 



z^e- s ' 2 I( S )ds 



e 2z *e- s ds. 



(3.132) 
(3.133) 
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The exponential factors can be removed by changing the time variable to 

v = l- e~\ 

so that 

dv = e' l dt 
= (1 - v)dt. 



(3.134) 



(3.135) 



This transformation maps the time to the unit interval [0, 1). It is also convenient to define a new 
scaled coherent amplitude to remove the systematic variation: 



a v = a t e t/2 . 



(3.136) 



Here the v subscript indicates the scaled amplitude, and the t subscript indicates the original, 
unsealed amplitude. Since these are equal to each other at zero time, there is no ambiguity in the 
initial amplitude a. The definition of I(t) must also be altered to remove the exponential factors: 



I(t) = lim lim 



6N+ - 5N- 



<5t^o| 7 |->oo |7|e"*/ 2 (5t 
Using this gives the photocurrent as 

I(v)dv = 2Re{a v e- l ^)dv + dW(v). 
With these changes of variables, the definitions for A v and B v become 



j4 7; — 



B,„ — 



V 

J e l *I{u)du, 
o 

v 

-I 



e 2 ^du. 



(3.137) 
(3.138) 

(3.139) 
(3.140) 



The SDE for the scaled amplitude a v is 
da v — d ^a t e'/ 2 ^ 



i, M , BfdWjt) s $ i9 

5 i - fWF 



e 1 ' 2 + \a t e l l 2 dt 



l-|Bf|2 



i & v dw {v){Bfei , + e _ n 



l-vl-\Bl\ 2 

Similarly the SDE for B^ in terms of the new time variable is 

dv 



d B s v =-^B s v (l + e-™B s v ) 



(3.141) 



(3.142) 



Initial calculations were performed using these equations, but there is a further simplification that 
can be made. The solution for Bf, is 



B* = 



l-v 



v (Bl)-i-B*- 



(3.143) 



The only calculations that will be presented in this thesis that were calculated using numerical 
integration to find B^ rather than this solution are those for larger photon numbers for the constant 



3.3. SIMULATION METHOD 



71 



e case where the optimum value of e was found numerically. It was found that performing the 
integration rather than using this solution altered the results by less than 0.5%. 

It is also possible to consider input states with more general mode-functions in a similar way |}5) . 
In the general case, the input state has a time-varying mode function u(t) that is real and positive, 
and is normalised so that 

T 

u(t)dt = 1, (3.144) 

where T is the total pulse length. The mode function gives the systematic variation of (a), i.e. 

(a) = a t oc y/u{t}, (3.145) 



if any stochastic variation is ignored. If the photocurrent is defined as in (3.124), then the definitions 
for At and Bt are 



t 



y/u(s)I(s)ds, (3.146) 





t 



B t = - e 2i ^u(s)ds. (3.147) 



For the case of squeezed states, the mode-function is 

u{t) = e"*. (3.148) 

Using this mode-function these definitions are identical to those used in §33] . 

In [H there is no factor of y/ u(s) in the definition of A t . This is because I(t) is defined in a 
slightly different way. In J35| it is assumed that the local oscillator is varied with the same mode- 
function, so that 

| 7 | = v^WA (3-149) 
where [3 is a constant. The photocurrent is then defined as 

I{t) = lim lim SN+ ~ 6N ~ . (3.150) 

,5t->0/3— oo P5t 



This is equivalent to using 



5N+ - 5N- 



lit) = lim lim — ^— x Ju(t). (3.151) 

«t^0| 7 |-KX> \j\St 



As there is a factor of y/ u{t) here, there is no need for it in the definition of A t used in p5| |. 
The mode-function can be removed by redefining I(t) as 

lit) = lim lim 6N+ 1^5' , (3.152) 
«-0H-<x, \ 7 \y/^(fj5t 

and changing the time variable to 

t 

v = J u{s)ds. (3.153) 
o 

With these changes of variables, A v and B v are now given by 

V 

A v = [ e^I(u)du, (3.154) 



B v = - / e 2l9 du, (3.155) 
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and I(t) simplifies to 



where 



I(v)dv = 2Re[a v e- l ' Hv) ]dv + dW(v), 



a t /\/u{t). 



(3.156) 



(3.157) 



Here a v may vary stochastically, but should have no systematic variation. Note that for u(t) = e , 
these changes of variables are identical to those used previously to remove the exponential factors. 

An additional complication to the numerical technique is that rather than using the feedback in 
the form 

v = argC^A< v \ (3.158) 



the form used was 



<p v = arg C v 



A v 
C v 



e(v) 



(3.159) 



These forms are almost always equivalent; however, for some cases different results are obtained. If 
the simple form ( 3.158 ) is used, this tends to bias the feedback phases towards zero. The reason for 
this is that fractional powers generally have multiple solutions, and the program does not necessarily 
give the appropriate one. 

To demonstrate this, consider for example the case where e = |. If the phases of A v and C v are 
27r/3 and — 2n/3, then clearly the phase estimate that we want is halfway between these, at tp v = tt. 
The feedback in the form (3.158|) will give 



ip v = arg(e 



-i27r/3\0.5/ e i27r/3-j0.5 



arge" l7r/3 e l7r/3 

argl 

0. 



(3.160) 



What has happened is that the power of 0.5 has rotated the phases towards zero, so that mul- 
tiplying C" 5 an d 5 together has yielded a phase of zero. On the other hand, if we use the form 
given by (3.159), we find 



p v = arge-^/^e- 12 ^ 3 ) - 5 
= arge-' l27r/3 e- l7r/3 
= arg(-l) 

= 7T, 



(3.161) 



which is the correct result. I will usually give the feedback in the simple form ( 3.158j ), even though 
it is necessary to use (3.159) in actual calculations. 



3.4 Time Steps 

Before performing the calculations, it is important to estimate the inaccuracy introduced by the 
finite time steps. This is necessary to determine how many time steps are required in order to keep 
the error due to this factor negligible. In Ch. [I| it is shown that the minimum phase variance when 
there is a time delay of r is r/(8n). It is reasonable that there should be a similar scaling for the 
error due to the finite time step. 

In order to estimate the error due to the time step, the phase variance was determined numeri- 
cally for mark II measurements on squeezed states optimised for minimum intrinsic phase variance. 
Calculations were performed for mean photon numbers of approximately 122, 432, 1577 and 5877. 
For each photon number calculations were performed simultaneously for 2 m intervals, with several 
different values of to. For photon numbers up to 1577, the value of to was varied from 7 to 16, and 
for n = 5877, to was varied from 8 to 17. For each time delay 2 14 samples were used. 
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In order to minimise the error between the calculations with different numbers of intervals, the 
random numbers for the smaller numbers of intervals were determined by combining the random 
numbers for the larger numbers of intervals. Specifically, for a step size of Sv, dW was replaced in the 
integration with SW , which has a Gaussian distribution with a variance of Sv. For the integration 
with twice the step size, Sv' = 2Sv, adjacent increments SW were added to give the new increment 
SW, with variance 2Sv. 



The results for mean photon numbers of 122, 432, 1577 and 5877 are shown in Figs 3.4 



3.6 and 3.7, respectively. In these figures the straight lines shown are based on weighted linear 



regressions. The slopes found for each of these graphs were 

0.180 ±0.010 



S. 



3.4 



n 

0.185 ±0.009 

n 

0.205 ±0.009 



n 

- °- 230±0 -° 12 . (3.162) 

n 

These uncertainties are based on the uncertainties of the individual points. As the same set of 
random numbers was used for each time step size, it would be expected that the uncertainty based 
on the deviation of the points from the line would be smaller. This is true for the smaller photon 
numbers, but not for the larger photon numbers where the points tend to fit the line poorly. 

Note that there tends to be some nonlinearity in the results for the larger photon numbers. This 



is most obvious in Fig. 3.7, where the variances for the larger time steps are much higher than would 
be expected based on a linear interpolation from the results for small time steps. In addition the 
error bars are larger for these time intervals. The reason for this is that there are a small number 
of results with large errors in the data sets on which these points are based. This causes both the 
variance and the uncertainty in the variance to be larger. This is also true to a lesser extent for the 
smaller photon numbers. 

We are only interested in the phase variance for small time steps here, as we wish to use time 
steps small enough to keep the error due to the finite step size below about 1%. For small step sizes, 
the effect of this type of nonlinearity will be that the error is less than that expected based on the 



linear fits for larger time steps. For example, if the last two points in Fig. 3.7 are omitted, the slope 
obtained is 0.13/n. 

There is a fair amount of variation between the slopes found for the different photon numbers, 
and for different ranges of the linear fit. Nevertheless, the purpose of these calculations is not to 
find an exact result, but rather to place an upper limit on the error due to the finite step size. In 
each case, we find that 

AV(cf>) < — . (3.163) 

This means that if we use time steps of 

*=^. (3-164) 
where V(4>) is the total phase variance, the error in the phase variance should be less than 1%. 



3.5 Naive Constant e Feedback 

The first method that I will consider is that where the values of e are the optimum values predicted 
by the simplified theory: 

e= (2a op te 2C ^ f . (3.165) 

For the results presented here, the states used arc squeezed states optimised for minimum intrinsic 
phase variance. For these states, the theoretical limit to the total phase variance is twice the intrinsic 



74 



CHAPTER 3. OPTIMUM ADAPTIVE DYNE MEASUREMENTS 



x 10" 5 
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5v x10" 3 

Figure 3.4: The phase variance for mark II measurements on optimised squeezed states with a mean 
photon number of 122 with various time step sizes. The crosses are the numerical results, and the 
continuous line is that fitted by a linear regression. 




Figure 3.5: The phase variance for mark II measurements on optimised squeezed states with a mean 
photon number of 432 with various time step sizes. The crosses are the numerical results, and the 
continuous line is that fitted by a linear regression. 
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Figure 3.6: The phase variance for mark II measurements on optimised squeezed states with a mean 
photon number of 1577 with various time step sizes. The crosses are the numerical results, and the 
continuous line is that fitted by a linear regression. 




Figure 3.7: The phase variance for mark II measurements on optimised squeezed states with a mean 
photon number of 5877 with various time step sizes. The crosses are the numerical results, and the 
continuous line is that fitted by a linear regression. 
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0.5 



10 3 _ 10 4 10 5 

n 

Figure 3.8: The variance for phase measurements using the predicted optimum value of e as a ratio 
to the theoretical limit. 



phase variance. The variance for mark II measurements (on general states optimised for minimum 
phase variance under these measurements) does not exceed this limit until a photon number of 
about 140. This does not contradict the theory, because the theoretical limit is only expected to be 
accurate for large photon numbers. Because of this, the improved phase measurement schemes will 
only be considered for photon numbers above about 140. 

The results using this predicted value of e are shown as a ratio to the theoretical limit (of twice the 
intrinsic phase variance) in Fig. |3.8|, and as a ratio to the phase variance for mark II measurements 



in Fig. 3.9. For the results shown in these figures, 2 samples were used. For moderate photon 



numbers above 140 this feedback does give an improvement over mark II measurements, and is close 
to the theoretical limit. This is true only over a small range of photon numbers, and for photon 
numbers over about 5000 the phase variance is significantly over the theoretical limit. In fact, for 
photon numbers above about 160000 this feedback technique gives larger phase variances than the 
mark II technique. The best improvement in the phase variance is at a photon number of about 
20000, where the phase variance is about 36% of the mark II phase variance. 

The reason for these poor results is that the val ues o f \A\ given by this feedback technique are 
too high. If we plot the mean values of \A\ 2 (see Fig. |3~To| ), we see that although they start out close 



to e, the agreement gradually gets worse and worse. Now recall from the above results for coherent 
states that although there is agreement with theory over a large range of values of e, for smaller- 
values of e, larger photon numbers are required in order to have good agreement with theory. 

In fact, the photon number required scales at least as e~ 2 . Conversely, we can say that the 
mimimum value of e for good agreement with the theory scales roughly as nT 1 / 2 . For the predicted 
values of e above, we find that e scales as log 2 n/n. Clearly this means that for the larger photon 
numbers, the value of e will be too small for good agreement with theory. 



3.6 Optimised Constant e Feedback 

In order to obtain smaller values of | A\ , and therefore phase variances that are closer to the theoretical 
limit, we can use different values of e. We would at first expect that reducing e would give smaller 



values of \A\; however, this is not necessarily the case. As demonstrated in Fig. 3.11, as the value 
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Figure 3.9: The variance for phase measurements using the predicted optimum value of e as a ratio 
to the phase variance for mark II measurements on optimum input states. 




10 3 _10 4 10 5 

n 

Figure 3.10: The mean values of \A\ 2 obtained for phase measurements using the predicted optimum 
value of e. The analytic result of (|^4| 2 ) = £ is shown as the continuous line and the numerically 
obtained values of (|^4| 2 ) are shown as crosses. 
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Figure 3.11: The mean values of \A\ 2 for measurements on coherent states with arg(C^ _£ A s v ) feed- 
back. The results for a = 4 are shown in dark blue, for a = 8 in green, for a = 16 in red, for a = 32 
in light blue, for a = 64 in purple, for a — 128 in yellow, for a = 256 in black, for a — 512 as a 
dashed dark blue line, and for a = 1024 as a dashed green line. 



of e is decreased for a fixed photon number, the value of \A\ 2 decreases initially, but it reaches a 
mini mum , and then as e is decreased further \A\ 2 increases. (This figure is for the same data as 
Fig. H) 

As we are in the region where the approximate theory breaks down, rather than using a predicted 
value of e, it is better to vary e to determine which value gives the smallest phase varianc e for each 
mean photon number. The phase variances obtained by this method are plotted in Fig. |3.12 as a 
ratio to the theoretical limit, and in Fig. 3.13 as a ratio to the mark II phase variance. For these 
results 2 11 samples were used for moderate photon numbers, and 2 9 samples were used for photon 
numbers above 2 x 10 6 . 

Again the phase variance is close to the theoretical limit for photon numbers up to about 5000, 
but beyond this the phase variance is greater and greater than the theoretical limit. Unlike the 
previous case, however, the phase variance continues to get smaller as compared to the mark II 
phase variance. For the maximum photon number calculations have been performed for, the phase 
variance is less than 10% of the mark II phase variance. 

If we plot the ratio of the numerically determined optimum value of e to the analytically predicted 
value (see Fig. 3.14 ), we find that for smaller photon numbers the optimum value of e is less than the 
predicted value. As the photon number is increased, the optimum value of e is greater and greater as 
a ratio to the predicted value. The actual value of e is still decreasing with photon number, though, 



as demonstrated in Fig. 3.15 



These results indicate that although this phase feedback scheme is not achieving the theoretical 
limit, it is achieving a better scaling than for mark II measurements. Assuming that h(n) ~ cn~ p , 
a numerical fit was performed on the data to determine c and p. Recall from Sec. 2.3.3 that the 
approximate phase variance for general measurements on squeezed states is 



V{4>) « V(^can) + 2c7T 



1 



P(P + 1) 



2n 



(3.166) 



Rather than simply assuming that the introduced phase variance was 2cn p , this expression was 
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Figure 3.13: The variance for phase measurements using the numerically determined optimum value 
of e as a ratio to the mark II phase variance. 
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n 



Figure 3.14: The numerically determined optimum values of £ as ratios to the analytically predicted 
optimum values of e. 
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Figure 3.15: The numerically determined optimum values of e for &ig(Cl~ e A%) feedback. 
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Figure 3.16: The phase variance for &ig(Cl~ e A £ v ) feedback using the numerically determined opti- 
mum values of e. The numerical results are shown as crosses and the continuous line is that fitted 
to the data. The mark II phase variance (dash-dotted line) and theoretical limit (dotted line) are 
also shown. 

used in order to obtain more accurate results. In addition, only data points for photon numbers 
above about 10000 were used, as the data for small photon numbers gave a poor approximation of 
the asymptotic result. It was found that the best fit was for c — 0.095 ±0.008 and p — 1.685 ±0.007. 
The data and the fitted line along with the mark II case and the theoretical limit are shown in 
Fig. 3.1(: . As can be seen, the power law is a very good fit for the data. 



Note that, because the value of e used depends on the photon number, this phase measurement 
scheme can not be described by a single POM. Instead, the actual POM and H matrix will depend on 
the photon number. Nevertheless, it is reasonable to consider these measurements as approximately 
equivalent to a measurement scheme with a single POM, and the variation of h(n) as given above. 



3.7 Time Dependent e 
3.7.1 Simple Method 

In order to improve on this result, the remaining alternative is to vary e during the measurement. As 
the approximate theory above does not give any information about how e should be varied during 
the measurement, trial and error was used initially. It was found that an improvement over the 
constant e case could be obtained if e was increased linearly during the measurement. Even better 
improvements were obtained when e was increased proportionally to v 2 or u 3 , or some higher power 
of time. 

A possible reason why e should be increased during the measurement can be deduced from 
Eq. (3.28). In the case that we are considering a squeezed state, rather than a coherent state, the 
phase of a v is now time dependent. The phase estimate arg C v gives an estimate of the initial phase, 
and takes no account of the variation of the phase of the running value a v . This means that when 
the variance of arg a v is large, the intermediate phase estimate may not be between the phase of a v 
and A v , even though it is between the phases of C v and A v . 

If the variance of the phase estimate is kept sufficiently above the variance of arg a v during the 
measurement, then this problem should be corrected. Therefore a statistical phase feedback scheme 



82 



CHAPTER 3. OPTIMUM ADAPTIVE DYNE MEASUREMENTS 



1.1 - 




n 



Figure 3.17: The phase variance as a ratio to the theoretical limit for the feedback scheme where e 
is chosen such that the variance of the intermediate phase estimate is no less than a fixed multiple 
of the variance of arg a v . 



was used, where at each time the value of e was chosen such that the variance in the phase estimate 
was at or below some fixed multiple of the variance of argo^. These variances were determined 
numerically from a large number (2 12 ) of simultaneous calculations. The best multiple to use cannot 
be predicted, and was determined numerically from these samples. 

The results obtained using this method are shown as a ratio to the theoretical limit of twice the 
phase variance of an optimised squeezed state in Fig. 3.17. The results are far better than for the 
case where e is kept constant, with the phase variance being below or close to the theoretical limit 
for all the photon numbers tested. Even for the largest photon number it was feasible to perform 
calculations for the phase variance is less than 8% above the theoretical limit. 

The optimum limiting ratio between the phase variances turns out to be different for the different 
photon numbers. The optimum values obtained arc shown in Fig. 3.18. The minimum value is around 
2.5, which is about what would be expected in order to prevent the phase estimates being outside 
the interval between argat, and axgA v . The optimum limiting ratio can be much higher, however, 
and generally increases with photon number. 

Another factor is that initially aigC v will be a poor phase estimate, and therefore the phase 
estimate may not be between the phase of a v and A v if e is too small. This would seem to indicate 
that e should be large initially to take account of this. It was found that this in fact made the 
measurement poorer. This indicates that the situation is not as simple as indicated in the analysis 
above. 

This phase measurement scheme, although it provides measurements at or very close to the 
theoretical limit, is unsatisfactory because the values of e are determined statistically from a large 
number of measurements, rather than determined from the individual values of A v , B v and v. It is 
possible to determine the variation of e numerically in this way for a particular photon number, and 
use this variation to obtain results close to the theoretical limit for independent measurements. This 
requires knowledge of the photon number, however, and the variation takes a long time to calculate. 

In order to determine the values of s to use in a simple way, we can consider the approximate vari- 
ation of the variance of arga„. Taking the system phase to be zero, and making the approximations 
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Figure 3.18: The optimum limiting ratio between the variance of the phase estimate and the variance 
of argent, . 



that e w i and B% is approximately real, Eq. ( p,14l|) simplifies to 



da v w — 



i B^dW 
1-vl + Bf,' 



Using the approximation B„ « 1, this becomes 



G?a„ 



1-1! 2 

Using this to determine the variation in the phase of a v gives 

darga„ = Im[<iloga„] 
= Im 



Im 



da v (da v ) 
a v 2a 2 

-i dW 1 dv 

— — , (1-d)^ 4 



2a? 



Assuming that a v remains close to its initial value a, which is real, this simplifies to 

-dW 



d arg a v 



2(1 - v)a 

Using this, the increment in the expectation value of (arga„) 2 is 

dv 1 



This gives us 



d((arg a v f) 
((argo^) 2 ) 



4a 2 (1 - v) 2 ' 

1 1 

la 2 (l-v)' 



(3.167) 



(3.168) 



(3.169) 

(3.170) 

(3.171) 
(3.172) 
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During the course of the measurement we wish to keep the variance of the phase estimate at 
some constant, k, times the variance of the phase of a v . Therefore the variance in the phase estimate 
should be 

<*> - ^ahry (3 ' I73 > 

As the simplest approximation, the variance of a,rgA v can be assumed to be the same as for the 
mark I case, where it is l/(4a^/v). Approximating the phase estimate as e(v) a,rgA v , this gives 

^4(R- (3 - m) 

This implies that the value of e(v) should be 




(3.175) 

This approximation is a bit too simplistic, as the variance of argA„ increases as e is decreased. 



Alternatively, using the result given in Eq. (3.41), we find 



k^ 2 



< V ^{a) (I^F (3 - 176) 
This result is not necessarily any more accurate than the previous one, however, as the solution 



( 3.41 ) is only accurate for constant e. 

The full differential equations taking into account the time dependent e are extremely difficult 
to solve, and do not appear to have a simple solution of the above form. Therefore we will instead 
consider the numerical results for the full calc ulati on. The time dependence of e(v) for a mean 



photon number of about 22255 is shown in Fig. 3.19 . The analytic approximation shown was found 
by trial and error, and is 

The dividing factor of 350 here is around twice the value of a. 

The common features of the above expressions for e are a factor of v to some power in the 
numerator, and factors of 1 — v and a to some power in the denominator. Various expressions with 
these features were tested, but the one that gave the best results was 



e{v) = -J-\. (3.178) 
a v 1 — v 

Note that this is very similar to the above analytic approximation for the exact results, except that 
1 — v is taken to the power of 0.5 rather than 0.8 (as well as a difference of a multiplicative factor). 

This expression suffers from the slight drawback that it is dependent on the value of a, rather 
than the experimentally measured quantities. This is easily corrected by using the slightly modified 
expression, 

v 2 - \B~ 



Here |CV,|/(u 2 — |-B„| 2 ) is used as an estimator for a. 



The results for this method are shown in Fig. 3.2C as a ratio to the theoretical limit. As this 
figure shows, the results are very close to the theoretical limit, and even for the largest photon 
number for which calculations have been performed the phase uncertainty is only about 4% above 
the theoretical limit. For these results 2 11 samples were used. This sample size was used for the rest 
of the results in this chapter, unless otherwise stated. 

If the integration time step is reduced, while keeping the time interval at which the phase es- 
timates are updated constant, the phase variance converges. If, however, the phase estimates are 
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Figure 3.20: Variance for phase measurements with a time dependent e given by Eq. (3.179) plotted 
as a function of the photon number of the input state. The phase variance is plotted as a ratio to 
the theoretical minimum phase variance (i.e. twice the intrinsic phase variance). 
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Figure 3.21: Values of £ R and n p (calculated from A and B) resulting from measurements on squeezed 
states of various mean photon numbers. The variation of £ p with n F for optimum squeezed states 
is also plotted (continuous line). 



updated at smaller and smaller time intervals then the phase variance does not converge. For exam- 
ple the phase uncertainty for measurements on an optimised squeezed state with a photon number 
of 1577 is 1.54 x 10~ 6 if we use the time steps given above. If we use time steps that are a hundred 
times smaller, then the phase variance is 1.93 x 10 -6 , and if the time steps are a thousand times 
smaller the phase variance is 2.13 x 1CP 6 . These results indicate that the phase estimates must be 
incremented in finite time intervals for this method to give good results, and the size of the time steps 
that should be used depends on the photon number. The phase variance is not strongly dependent 
on these time steps, however, and only an order of magnitude estimate of the photon number would 
be required. 



3.7.2 Evaluation of Method 

A problem with determining the phase variance by the method above is that for highly squeezed 
states (that are close to optimised for minimum phase variance), a significant contribution to the 
phase variance is from low probability results around it. In obtaining numerical results the actual 
phase variance for the measurement will tend to be underestimated because the results from around 
7r are obtained too rarely for good statistics. It would require an extremely large number of samples 
to estimate this contribution. However, we can estimate it non-statistically as follows. 

Recall that in order to have a measurement that is close to optimum, the multiplying factor 
Q'(™ P i£ P ) should give values of £ p for each n p that are close to optimised for minimum phase 
uncertainty. To test this for the phase measurement scheme described above, the n p and £ p were 
determined from the values of A and B from the samples. The resulting data along with the line 



for optimum £ are plotted in Fig. 3.21. The imaginary part of £ should be zero for optimum 



measurements, and is small for these results. Therefore in Fig. 3.21 the real part, £ R , is plotted. As 
can be seen, the vast majority of the data points are below the line, indicating greater squeezing 
than optimum. (There are more points above the line for large n; more will be said about this later.) 
This means that if the low probability results around 7r are taken into account the phase variance 
for these measurements will be above the theoretical limit. 
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I will firstly consider the effect of variations in the modulus of £ p , leaving consideration of error 
in the phase till later. In order to estimate how far above the theoretical limit the actual phase 
variance is, we can make a quadratic approximation to the expression for the phase variance. From 
[1451 the expression for the intrinsic phase variance of a squeezed state is, for real £, 



V(<f>) 



— + —^ + 2erfc(V2^e c ). 
An An 2 



Taking the derivative with respect to £ gives 



v{4>) 



,:,2C 



277 



-4e c 




Taking the second derivative we obtain 



(3.180) 



(3.181) 



,2C 



Ae c \ — (-477e 2C )e 



Using the fact that the first derivative (3.181) is zero for minimum phase variance gives 



(3.182) 



4e c W— e~ 

7T 



277 



(3.183) 



Using this result, the expression for the second derivative simplifies to 



,2C 



,2C 



2C 



d( 2 V ' 77 277 277 



(3.184) 



This means that for values of £ close to optimum the increase in the phase variance over the optimum 
value is 

An 2 



AU(0)«(A|C|) 2 ^(l + 4n o ). 



(3.185) 



Here the absolute value of £ has been used for greater generality. 

The main contribution to the phase uncertainty is no/(477 2 ), so the increase in the phase uncer- 
tainty as a ratio to the minimum phase uncertainty is approximately 



AU(^) 

Knin(^) 



(A|CI) 2 (l + 4n ). 



(3.186) 



Note that as the photon number is increased, no increases roughly as log 77. This means that in 
order for the percentage increase in the excess phase uncertainty to remain limited, the error in |£| 
must decrease with photon number. 

Now I will use the superscript P to indicate specifically the squeezed state in the POM. Since 
|C P | = |^ p |, we obtain 

AU(0)«(Ari) 2 ^(l + 4n o ). (3.187) 

Here I have used the values of no and 77 for the input state, rather than the squeezed state in the 
POM. As the input states considered here are optimum squeezed states, the average values of these 
quantities for the squeezed state in the POM should be close to those for the input state. For a more 
general input state we would still use the value of 77 for the input state, but the value of no should 
be chosen as the corresponding value for an optimum squeezed state with mean photon number 77. 

This estimate indicates that the actual phase variance for the measurement scheme described 
above can be significantly larger than the intrinsic phase variance. For example, for a mean photon 
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number of about 332000, the rms deviation of |£ p | from the optimum value is only about 0.16, but 
a squeezed state with |£ p | differing this much from optimum will have a phase variance more than 
twice the optimum value. This indicates that if the low probability results around ir are taken into 
account, the introduced phase variance is actually more than twice the theoretical limit. 

Now I will estimate the contribution from error in the phase (rather than the modulus) of £ p . 
Recall that in attempting to estimate the phase of a state, we measure quadratures close to ir/2. 
For the n/2 quadrature 

X„ /2 = -«(a-a t ), (3.188) 

the expectation value is 
For small <p, this means that 



X n/2 ) = 2 |a| sin^. (3.189) 



X„ /2 }Ri2\a\<p. (3.190) 

Canonical measurements can be considered by assuming that a is known, and that the tt/2 quadra- 
ture is proportional to the phase. This would mean that a measurement of X n / 2 is equivalent to a 
direct measurement of the phase. 

For simplicity the system phase will now be taken to be zero. The estimated phase is then pro- 
portional to the measured value of X^/ 2l and the variance in the phase estimate will be proportional 
to the variance in the tt/2 quadrature: 

AX2 /2 )«4n(A^). (3.191) 

Here the approximation n w \a\ 2 has been used. This is reasonably accurate for the states that are 
considered here. Now recall that the uncertainty in this quadrature for a squeezed state is 

(a! 2 /2 ) = e 2r cos 2 ±0 C + e- 2r sin 2 \<j> c , (3.192) 

where r and (j)^ are the magnitude and phase of (. 
Therefore the uncertainty in the phase is 

, . . e 2r cos 2 i<?V + e~ 2r sin 2 ^d>r 

(A(j> 2 ) = 2VQ 2 -i, (3.193) 

4n 

If the phase of ( is close to ir (so ( is close to negative real), we can make the approximation 

p -2r i 2r (A</><;) 2 

<A0 2 ) « + 4 _ 4 , (3.194) 

where A<j>^ — <f>^ — tt. The first term in the numerator is identical to the first term for the intrinsic 
phase variance of a squeezed state. Clearly the second term is the excess phase variance due to the 
error in the phase of £. Therefore the extra phase variance due to error in the phase of ( is given by 

- ^0!. (3.195) 
16n 

Note that the error in the phase of £ is equivalent to the error in the phase of ( above, as the 
above case is for a system phase of zero. Now using the superscript P to indicate specifically the 
squeezed state in the POM, we have 

Here we are again using the value of no from the input state. Using this estimate on the example 
used for the magnitude, with n w 332000, it can be seen that this is not so much of a problem, with 
the introduced phase uncertainty being increased by less than 3% by this factor. 
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3.7.3 Corrected Method 

The problem of the large contribution from low probability results around ir can be effectively 
eliminated by using corrections near the end of the measurement. In order to describe this we 
must firstly consider the values of a p and £ p for intermediate times. From Ref. |kJ, the POM for 
intermediate times is 

F V (A V ,B V ) = Q V (A V ,B V ) exp(±B v a^ + A v a)){\ - vf a exp(±5> 2 + A* v a). (3.197) 

This is mixed, not pure, and it is therefore not possible to simplify it to an expression in terms of 
squeezed states, as is the case at the end of the measurement. I will therefore define the values of a p 
and £ p by analogy with the definitions at the end of the measurement, but these will not actually 
correspond to squeezing parameters for a squeezed state in the POM. 
Recall from the introduction that for a coherent state we have 

vA v + B V A* V = a(v 2 - \B V \ 2 ) + i (va v - B v a* v ) , (3.198) 

so 

vA v + B V A* V i (va v — B v a*) 



IB,, I 2 v 2 -\B„ 



(3.199) 



This means that C v j (v 2 — \B V \ 2 ) can be used as an estimator for a. The situation is more complicated 
for squeezed states, as there are extra terms due to the stochastic variation of a. It is still possible to 
use C v /(v 2 — \B V \ 2 ) as an estimator for a, however. As a p is defined by the value of C v /(v 2 — \B V \ 2 ) 
for v = 1, it is therefore reasonable to define intermediate values a p by 

P A v v + B V A% . onn\ 

«. = V 2 _ jfiTja ■ ( 3 - 20 °) 

For the case of (£, note that £ p is very large when all the feedback phases are close to each 
other, and B is correspondingly close to 1. In order for £ p to have the same property, the argument 
of the atanh function should be close to 1 when the phase estimates are very close together. As 
the maximum value B v can have is v, this will be the case if B v is divided by v. Using this, the 
definition of £ p is 

iSr" 1 ^)- (3 - 20l) 

Note that this relation of a p and £ p to A v and B v is not analogous to the relation of a v and 
Cu to Af and B^. The quantities A^ and B^ are defined by analogy to A v and B v at time v = 1. 
The time varying values of A^, and B^ are found by using the same definition with the time varying 
squeezing parameters. 

Near the end of the measurement, at each time step the photon number is estimated from the 
values of A v and B v . The estimator used is 

|a p | 2 +smh 2 |C P |. (3.202) 

I will call this estimator n p , so that this variable is analogous to n p . The optimum value of £ p is 
then estimated using an asymptotic formula based on the result in fj~5f , 

^ = -|log( J" V (3.203) 
Vlog< + A'/ 

where 

A' = 2 log 2 - \ log 2tt rs 0.926825. (3.204) 



Note that this A' differs from the A defined in Eq. ( 2.136| ) of Sec. 2.3.1 by 1.5. The optimum value 



of £ p will be complex, with a phase dependent on the phase of a p 
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If £^ (the real part of £ p ) is too far below this optimum value, rather than using the feedback 
phase of Sec. fo.7.1 , the feedback phase used is 



$(u) = -arg 



B„-u^tanh|£° pt | 



(3.205) 



Using this feedback phase takes B v directly towards the optimum value. To see this, note firstly 
that the optimum value of B v is 

£° pt = ^tanh|£° pt |. (3.206) 

This can be seen by inverting the definition of C^- The factor of C v /C* makes the phase of this zero 
relative to a P . 

If we then take the exponential of the feedback phase, we find 



e 2< * oc B v - B° pt , 



(3.207) 



so 



dB„ 



B° pt - B„. 



(3.208) 



Here the proportionality symbol indicates that the phase is equal, but not necessarily the magnitude. 
This demonstrates that this feedback phase takes the value of B v directly towards the optimum value. 

The details of exactly when (£■ is considered too far below optimum can be varied endlessly, but 
for the results that will be presented here this alternative phase estimate is used after time v = 0.9 
and when 

\C\ > ICV 1 "" 1 ^ 1 ^- (3-209) 

Using the exponential multiplying factor means that this alternative feedback is only used towards 
the end of the measurement. Only considering the alternative feedback in the last 10% of the 
measurement is necessary for the smaller photon numbers, where Eq. ( 3.209| ) is too weak a restriction. 
In Ref. fl5~fj the feedback phase 



arg 



B v 



/-<* 



(3.210) 



was also mentioned. It turns out that this is not necessary, and very good results can be obtained 
by using ( 3.205 ) alone. 

It was found that good results were obtained for a wide range of photon numbers when the 
value of A used was 10 -3 . It is also possible to adjust the value of A individually for the different 
photon numbers; however, this only gives marginal improvements. The results for this value of A 
are shown in Fig. 3.22. The contribution due to the error in the phase of £ p is small for the entire 
range considered, below 3%. The contribution due to error in the magnitude of £ p is even smaller 
for moderate photon numbers, but for the largest photon numbers it rises dramatically. 

The reason for this rise is that the above correction only corrects for values of £j that are 
below optimum, and for the larger photon numbers many of the uncorrected values of £ R are above 
optimum (see Fig. |3.21 ). In order to make the corrections work well, a dividing factor can be used to 
bring the uncorrected values below the line. For the second largest phot on num ber tested of around 
5 x 10 6 , the best results were obtained when the values of e as given by ( [3.179 ) were divided by 1.1. 
For the largest mean photon number tested, 2 x 10 7 , the best results were obtained for a dividing 
factor of 1.2. The value of A that gave the best results with these dividing factors was 5 x 10~ 4 . 

The results using these dividing factors and altered value of A for the larger photon numbers are 
shown in Fig. 3.23. In this case, the contribution due to error in the magnitude of £ p remains small, 
around 1.5%, for the largest photon numbers. The contribution due to the error in the phase of £ p 
does go up slightly for the largest photon numbers, but it is still well below 3%. The total excess 
phase variance due to error in £ p does not exceed about 5% for the entire range of photon numbers 
considered. 

The indication is that the optimum dividing factor will continue to increase for photon numbers 
beyond the maximum for which calculations were performed. Unfortunately, using these dividing 
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Figure 3.22: Contributions to the phase uncertainty from error in the magnitude of £ p (continuous 
line) and the phase of £ p (dash-dotted line). No dividing factors are used, and A = 10~ 3 . The 
contributions are plotted as a ratio to the theoretical minimum introduced phase uncertainty. 
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Figure 3.23: Contributions to the phase uncertainty from error in the magnitude of £ p (continuous 
line) and the phase of £ p (dash-dotted line). Dividing factors of 1.1 and 1.2 are used for photon 
numbers of 5 x 10 6 and 2 x 10 7 , respectively. For these photon numbers A = 5 x 10~ 4 , otherwise 
A = 10~ 3 . The contributions are plotted as a ratio to the theoretical minimum introduced phase 
uncertainty. 
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Figure 3.24: The phase variance as estimated using the phase data as a ratio to the theoretical 
limit. The continuous line is just the variance of the phase data, the dash-dotted line is the variance 
corrected for the low probability results around ±7r, and the dotted line is the corrected theoretical 
limit taking into account the variation in n p . 



factors means that the photon number must be known beforehand. Nevertheless, as the dividing 
factor required increases only very slowly with the photon number, only a rough, order of magnitude, 
estimate of the photon number is required. 

With this modified technique the phase variance again does not converge as the feedback phase 
is updated in smaller and smaller time intervals. The phase variance is less dependent on the time 
step with this technique, however. For example, for a mean photon number of 1577 the total phase 
variance for measurements on an optimised squeezed state only increases by about 9% as the time 
steps are reduced by a factor of 1000. In contrast, the phase variance increases by a factor of 38% 
for the uncorrected technique. 

As an alternative way of evaluating the results we can again consider the variance of the phase 
estimates obtained, as shown in Fig. |3.24 . In order to take account of the low probability phase 
results with large error, we can add a factor of 1/(4?T). The reason for this correction is that, from 
Ref. pis}] , the term 2erfc(v / 2no) is the contribution due to results with large error. From Eq. (2.134), 
for squeezed states near the theoretical limit this term becomes approximately l/(8n). We must 
add twice this, as phase measurements near the theoretical limit on optimally squeezed states have 
a variance twice the intrinsic variance of the squeezed state. 

As can be seen in Fig. 3.24, when this correction is included the results are noticeably above 
the theoretical limit. The results are on average around 10% above the theoretical limit, which is 
slightly more than would be expected from the previous analysis. The problem appears to be that 
the results where n F is small have disproportionately high errors. Even though (w P ) ~ n, we will 
find that 

'logn p + A\ > logTT + A^ 



m p2 I " 4n 2 

This means that the mean phase variance for the states |a p ,£ p ) will be higher than that for the 
state \a, Q, even if they are minimum phase uncertainty squeezed states. 

This seems to be an intrinsic problem with these type of measurements, as any state that has 
small phase variance will have large uncertainty in the photon number. It is therefore reasonable to 
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postulate that it is not possible to reduce the introduced phase variance below the mean value of 
(logn p + A)/(4n p ), and that this is therefore the actual limit. 

When the theoretical limit to the introduced phase variance is corrected based on the values 
obtained for n p , the theoretical limit to the total phase variance (i.e. the intrinsic phase variance 
plus the limit to the introduced phase variance), is as shown in Fig. 3.24. The phase variance 
performs much better when compared with this corrected theoretical limit, and even for the largest 
photon numbers is no more than about 5% above it. 



3.8 Beyond the Theoretical Limit 

The final question that will be addressed in this chapter is whether it is possible, in some cir- 
cumstances, to reduce the introduced phase variance below the theoretical limit. Recall that the 
theoretical limit is based on the probability distribution for A and B being given by 

P(A,B)oc\^\a F ,C P )\ 2 - (3-212) 

If the phase estimate that is used is arg(A + BA*), then this implies that the introduced variance in 
this phase estimate is the intrinsic phase variance of the state |a p ,£ p ). If the phase estimate used 
is not arg(A + BA*), however, this limit does not apply. 

The main example of this is homodyne measurements. For homodyne measurements, we find 
that A + BA* = 0, so it is not possible to use the phase estimate arg(yl + BA*). Instead, we must 
use a phase estimate that relies on prior knowledge of the state. For a homodyne measurement 
the local oscillator phase is equal to the system phase plus tt/2. We do not need to consider the 
photocurrent as a function of time I(t), as the measurement is essentially just a measurement of the 
phase quadrature X$, where 

X* = ae'^ + ^ e 1 " 6 . (3.213) 
For arbitrary system phase, the expectation value of the ir/2 quadrature is 

(je 7r/2 )=2|a|sin^. (3.214) 

This means that, provided |a| is known, we can use 

^asin(|j^) (3.215) 

as a phase estimate. As the asin function is very linear near 0, this means that these measurements 
are extremely close to being direct measurements of the phase. 



3.8.1 Fitted Phase Estimates 

For the case of adaptive phase measurements, we can consider phase estimates that are based on 
fitting the phase to the data. The introduced phase uncertainty can then be estimated using the 
techniques of data analysis. If we consider the photocurrent for a coherent state over a small but 
finite time interval Sv, we have 

I(v)Sv = 2 \a\ cos(^ - <5>)5v + 6W(v), (3.216) 

where ip is the system phase, and $ is the local oscillator phase. This is equivalent to a series of 
data points yi — I(vi)6v, where 

(yi) = 2 \a\ cos(tp - $)6v. (3.217) 

If the values of |a| and <p are assumed to be \a^\ and <fi respectively, then the expectation values for 
the data points are 

& = 2|a / |cos(0-$) ( 5w. (3.218) 
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I will firstly consider the case where both the magnitude and phase of a are fitted for, then the 
case where \a\ is known, and only the phase needs to be fitted. To perform the fit, we wish to 
minimise 

M(a') = 5>i - &) 2 . (3.219) 



Expanding this gives 

M(a f ) = ^ [(I(vi)5v) 2 - 4I(v)\a } '\cos((p- $)<5v 2 + 4 \a f f cos 2 (0 - $)<5v 2 



(3.220) 



As the (I(vi)Sv) 2 term docs not depend on the fitting values, it can be omitted. We can also remove 
a constant factor of 2Sv. Then taking the limit of small 5v gives 



M'(a f )= J -2/(w)|a / |cos(0-$) + 2|a / | 2 cos 2 (0-$) 



dv. 



(3.221) 



This can be simplified to an expression in terms of A and B: 

M'{a f ) = Re \-2A*a f + \a f \ 2 - B*{a f f 



(3.222) 



As this should be minimised for the fitted value of a, the derivatives with respect to the magnitude 
and phase of a* should be zero. Thus we have 



d_ 

~d4> 



M'(a f ) = Im [-2A*a f - 2B*(a f ) 2 ] = 0, 



and 



d 



—. — jr-M'fa/) = Re \-2A*e Up f + 2 \a f \ - 2B* \a f \ e 2i ^l = 0. 
o\aJ\ L 11 

The solution of these equations is given by 



Q.J 



A + BA* 
1 - \B\ 2 ' 



(3.223) 
(3.224) 

(3.225) 



Thus we find that, if the coherent amplitude is unknown, fitting gives exactly the same phase 
estimate arg( A + BA* ) as has been used previously. 

In order to find the uncertainty in the fitted values, we calculate the matrix C, where 



Cii — 

C'i2 — 



d<j> 

d\af\' 



(3.226) 
(3.227) 



The covariance matrix is then given by 

a 2 M c \ (3.228) 

where Mq — C T C, and a 2 is the variance in the individual data points (which is 5v in this case). 
Evaluating Mq gives 



M c 



£4H sin 2 (<^-$)<^ 2 
- ^ M sm(ip ~ $) cos(</9 - $)6v 2 



4 |a| sin(ip - $) cos(ip - $)Sv 2 
J2^cos 2 (ip-^)5v 2 



(3.229) 



In data analysis we would normally evaluate this using the fitted value of a rather than the actual 
value, as the actual value is unknown, and we wish to estimate the uncertainty in the fitted value. 
Here, however, we are interested in the variance in the phase estimates for a given value of a, and 
it is therefore more useful to express the result in terms of the actual value of a. 
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Taking the inverse to find the variance in the fitted phase, we find 

4 cos 2 (tp - <&)5v 



var(</>) = j 



J2^\a\ sm 2 (ip-$)6v [J2 4 cos 2 (<^ - $)6v] - E 4 \ a \ ^(v 3 ~ cos(ip - $)6vY 



Taking the limit of small Sv, this becomes 

f / cos 2 (tp ~ §)dv 



var(0) 



4 |a| [J sm 2 (ip - $)dv] [J cos 2 (ip - $)dv] - [J s'm(ip - <&) cos(<,s - $)efo] ' 



(3.230) 



(3.231) 



From this expression we can see that smaller phase variances will be obtained if sin(<p — $) cos(tp — <f>) 
has a time averaged value close to zero. This is possible because s'm((p — $) cos(ip — <fr) can take 
negative values. We will also obtain small phase variances if the time averaged value of sin 2 (</3 — $) 
is close to 1. This means that the local oscillator phase should also be close to ip + Tr/2. Note that if 
we are using a local oscillator phase of (p + ir/2, using a better phase estimate ip does not necessarily 
result in a smaller variance, as we also want sm(ip — $) cos(^ — $) to average to zero. This gives an 
alternative explanation of the result found before that using the best intermediate phase estimate 
does not result in the smallest phase variance. 

In order to obtain better phase estimates, we can consider the case where \a\ is known, and only 
the phase is fitted for. Then it is easy to see that we obtain Eq. (3.222), except the actual value of 
\a\ is used: 



M'{4>) = Re 



-2A*|a|e^ + |a| 2 



r>* I |2 lid 



(3.232) 



As | a | 2 does not depend on the fitted phase, it can be omitted, so this simplifies to 



M'{<f>) = -Re 



2A* lale^ + B* \a\ 2 e 2i4 



Note that this is equivalent to the result obtained in Eq. (21) of Ref 
require 

SjM'((f>) = -Im 



2A* lale** + 2B* |a| 2 e 2 ^ 



(3.233) 

||. To have a minimum, we 
0. (3.234) 



Unlike the previous case there is no simple solution in terms of A and B. The solution to this can 
be found by finding the roots of the fourth order polynomial 



P(cos tf>) = 4 |a| \B\ 2 cos 4 <j) + 4 \a\ Re(A£T) cos 3 <p + (\A\ 2 - 4 \a\ 2 \B\ 2 ) cos 2 tj> 
-2\a\ (ImAlmB + 2ReAReB) cos0 + \a\ 2 (ImB) 2 - (Re A) 2 . 



(3.235) 



The variance in the phase estimate, on the other hand, can be found far more easily. We simply 
have 

M c = M 2 sin 2 (^- $)<5u 2 , (3.236) 



so the variance in the fitted phase is 

var(0) : 

In the limit of small Sv this becomes 

var(0) 



£4|a| 2 sin 2 (^-$)<5u 



4|a| / sin 2 ((p - $)dv 



(3.237) 



(3.238) 



This is much simpler than the case where both the amplitude and the phase were fitted. The smallest 
phase variance can be obtained by using local oscillator phases as close as possible to <p + tt/2, and 
there are no extra terms to complicate the problem. 
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It is clear that if an accurate estimate of the phase is known beforehand, it is possible to obtain 



a phase variance that is extremely close to the intrinsic phase variance of 1/4 \a\ 
oscillator phase is $ = tp + n/2, then the expression for the variance becomes 



If the local 



var(0) 



4|a| J cos 2 (ip — ip)dv 



4|d 



+ 



J(ip-ip) 2 dv 



4 Id 



(3.239) 



This indicates that the introduced phase variance is proportional to the variance in the intermediate 
phase estimates, so it should be possible to reduce the introduced phase variance practically indef- 
initely by using better intermediate phase estimates. In particular, it should be possible to reduce 
it below the theoretical limit based on &rg(A + BA*) as the final phase estimate. 

If the intermediate phase estimate is based on the measurement results so far, then the variance 
in the phase estimate cannot be smaller than the canonical phase variance for a coherent state with 
\a\ v photons, 1/(4 \a\ 2 v). We cannot use this directly in the above equation, as this variance goes 
to infinity for zero time. Since the average value of cos 2 for a randomised phase is 1/2, it is only 
reasonable to use the approximation 



(cos 2 (^-^))«l-((^-^) 2 ) 



(3.240) 



when it gives a value less than 1/2. In order to obtain an approximate result, we can use this 
approximation for v > 1/(2 |a| 2 ), and use 1/2 for v < 1/(2 \a\ 2 ). Then we find 



j cos 2 (p)dv w y y+y 



1 - 



1 



2|c|2 

-log2|a| 
4 Id 2 



4\a\ 2 v 



dv 



Therefore the phase variance should be approximately 

1 l + log2|a| : 



var(0) 



4 Id 



16 Id 



(3.241) 



(3.242) 



Thus we find that the introduced phase variance scales roughly as logn/(16n ). This is less than 
the theoretical limit, which scales as logn/(4n 2 ). 

To check the scaling when the calculation is performed exactly, we can use 



(cos 2 0) = i(l+Re(e 2 ^)). 



(3.243) 



Here the system phase has been taken to be zero for simplicity. For a coherent state with amplitude 
ay/v, we have 

_ „-a 2 v/2 (QyM" 



(n\ay/v) = e 



(3.244) 



This means that 



{a 2 v) n+1 



' oV /n!(n + 2)!' 



(3.245) 
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Figu re 3.2 5 
Eq. ( |3.246|) . 



The excess phase variance multiplied by 16n 2 for coherent states calculated using 
The calculated result is shown as the continuous line, and the dotted line is the fitted 



expression (3.247) 



Now taking the integral of this over time gives 



oo 



(a 2 v) 



n+l 



zdv 



m+l 



f ^n!(n + 2)! 



-.dt 



^ OO 



T(a 2 ,n + 2) 



(3.246) 



where T(a 2 , n + 2) is the incomplete gamma function. This can then be used to evaluate (cos 2 ip) 



exactly. The results calculated in this way are plotted in Fig. 3.25. 

It was found that the scaling of the introduced phase variance was again logn/(16n 2 ) when 
estimated using this more exact method. There was a difference of order n~ 2 with the previous 
result, however. It was found that the introduced phase variance was very close to 



as compared to 



logn + 1.7733 
16^ 



log 1.693147 
lo^ 2 



(3.247) 



(3.248) 



using the very approximate method of Eq. (3.241 



Note that this result is still based on assuming that the intermediate phase estimates are as 
good as canonical. To determine the result for the real case, where the intermediate phase estimates 
are based on the preceding data, we must perform stochastic integrals to determine the result 
numerically. To determine the results in this case, rather than performing the integrals separately 
for each mean photon number, a continuous calculation with an exponentially increasing time-step 
was used. Then at various times, the photon number up to that time and the integral of (cos 2 ip) 
were determined . 
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Figure 3.26: The excess phase variance multiplied by 16n 2 for coherent states as determined using 
stochastic integrals. The calculated result is shown as the continuous line, and the dotted line is the 
fitted expression. 



It is possible to do this in the case of coherent states, because the integral up to time v with 
coherent amplitude a is equivalent to the integral over the unit interval with amplitude cty/v. Note 
also that the integral of (cos 2 ip) does not need to be calculated separately, because 

V 

ReB v = J cos 2 <p u du. (3.249) 
o 

For this calculation the intermediate phase estimate was the best phase estimate using the known 
value of | a |, as found by solving Eq. ( fo.235 ). 



This calculation was performed for 2 14 samples, and the results are shown in Fig. 3.26| . The 



results again scaled as logn/(16n 2 ), but the term of order n~ 2 was different. The scaling obtained 
was approximately 

logn + 2.4 



16rT 



(3.250) 



As can be expected, this is slightly worse than the result calculated assuming that the intermediate 
phase estimates are as good as canonical. 

This demonstrates that if the coherent amplitude is known, it is possible to reduce the introduced 
phase variance to a factor of 4 below the theoretical limit (that scales as log n/ (4n 2 )). Unfortunately 
this is not very useful in the case of coherent states, as the introduced phase variance is always (for 
an adaptive feedback scheme) far less than the intrinsic phase variance. 

If the introduced phase variance remained this small for a squeezed state, then this would be a 
very significant result. Unfortunately, the result that the introduced phase variance should be fairly 
independent of the input state breaks down for this type of phase estimate, for similar reasons to 
why the theoretical limit does not apply. 



The best phase estimate as found by solving Eq. (3.235) is specific to coherent states, and the 
best phase estimate for other states will be different. For the case of squeezed states, this is a difficult 
calculation where the evolution of the state for each input phase must be determined based on the 
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Figure 3.27: The phase variances using argCt, feedback on optimally squeezed input states. The 



crosses are the results for argC phase estimates, the pluses are those found by solving Eq. (3.235), 
and the circles are those for the phase estimates based on numerically minimising Eq. (3.255). The 
continuous line is the theoretical limit, and the dotted line is the variance for mark II measurements. 



measured values of I(v). Specifically, in the discretised calculation we are using 

I(v)Sv = 2Re(a v e~ l ' s ' {v ' > )Sv + 5W(v). (3.251) 

For some assumed system phase, we can determine what value SW(v) would have using 

SW'(v) = I{v)8v - 2Re(a' v e- i *^)5v, (3.252) 

where the primes indicate that these are the values determined based on that assumed system phase 
(as opposed to the actual phase). For this value of 5W'(v), the evolution of a' v is then 



5a' = 



1 



B'lsW'{v) 



where 



v 1- \B' 



B' 



(i?' s )-i - B* ' 

After these values are calculated over the entire time interval [0, 1), we then determine 

£(<w») 2 . 



(3.253) 



(3.254) 



(3.255) 



We wish to minimise this in order to find the phase that gives the best fit to the data. 

It is feasible to use this method for the final phase estimates; however, it would make the 
calculation far too difficult if it was also used for the intermediate phase estimates. Therefore, for 
the intermediate phase estimates arg C v was used initially. The results of using this intermediate 
phase estimate on optimally squeezed states are plotted in Fig. [3.27 



If the argC phase estimate is used at the end of the measurement as well, the phase variance 
is far greater than the theoretical limit. The phase estimate based on minimising ( |3~255| ) gives a 
much reduced variance, but it is still far above the theoretical limit. In fact, it is still worse than 
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Figure 3.28: The phase variances for measurements on optimally squeezed input states using phase 
estimates in the feedback found by solving Eq. (3.235). The crosses are the results for argC phase 
estimates, the pluses are those found by solving Eq. ( 3.235| ), and the circles are those for the phase 
estimates based on numerically minimising Eq. (3.255). The continuous line is the theoretical limit, 
and the dotted line is the variance for mark II measurements. 



the phase variance for mark II measure ments. Also shown in Fig. 3.27 is the phase estimate based 
on coherent states, found by solving Eq. ( 3.235 ). These phase estimates give variances that are very 
close to, but slightly above, the variances for the fitted phases. 

As an alternative feedback scheme, we can use the phase esti mate s based on coherent states in 
the feedback. The results using this feedback are shown in Fig. |3.28 . The variances for the fitted 
phases are slightly less than those using arg C v intermediate phase estimates, but are still far above 
the theoretical limit or even the variances for mark II measurements. 

It is possible to obtain phase variances close to the theoretical limit if we use feedback that gives 
results close to the theoretical limit for the usual argC phase estimates. For example, if we use 
the corrected feedback as in Sec. fiJlj , we obtain the results shown in Fig. |3.29| . As the numerical 
results do not take account of the low probability results wi th la rge errors, a correction factor of 
l/(4n 2 ) has been added to these results (as was done for Fig. 3.24 ). 

In practice it was found that, for the smaller photon numbers where large error results were 
obtained, the fitted phases were still close to the argC phase estimates, so large errors were obtained 
for the same samples for both phase estimates. This can also be ensured by only performing the 
minimisation near arg C. Then the contribution due to the large error results will be the same for 
both cases. This is why the same correction factor of l/(4n 2 ) has been added to both sets of results 
in Fig. |3~29. 



It is seen in Fig. 3.29 that when the corrections are taken into account, the results for the usual 
phase estimate are slightly greater than the theoretical limit, but the results for the fitted phase are 
slightly less than the theoretical limit. This demonstrates that it is possible to surpass the theoretical 
limit in the case of squeezed states, but only by a very small margin. 



3.8.2 Optimal Phase Estimates 

The fitting approach considered in the previous section gives the most probable phase, but this is 
not necessarily the same as the phase estimate that minimises the variance. It is shown in Sec. 6.2 
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Figure 3.29: The phase variances for measurements on optimally squeezed input states using inter- 
mediate phase estimates as in Sec. 3.7.3. The continuous line is the phase variance for argC phase 
estimates and the dash-dotted line is the variance for phase estimates based on fitting. All variances 
are shown as a ratio to the theoretical limit. 



that the phase estimate that minimises the phase variance for intcrfcrometric measurements is 

7T 

if = arg / e^P{n m \<p)d<p, (3.256) 



where n m is the measurement record. This derivation is very general, and should also hold for dyne 
measurements, where there is a continuous measurement record L v ). 



Similarly to the case for interferometry, we have [using Eq. (1.72)] 

P(I [0 , v) \cp) cx P{cp\I [0 , v) ), (3.257) 
so the optimum phase estimate can be expressed as 



<f = arg / e lip P(ip\I [0 , v) )dip 

— TT 

= arg(e^>, (3.258) 

where the average is over the probability distribution for the phase based on the measurement record. 
In order to determine the probability distribution P(li 0iV \\(p), we need to determine the probability 
of obtaining the increments dW(v). Considering the discretised equation, each increment SW(v) has 
a Gaussian distribution with variance 5v. The probability of obtaining 8W{v) is therefore 

P(5W(v)) oc e -(sw(v)f/{2Sv)_ (3.259) 
The probability of obtaining the measurement record will therefore be 



P(I [M |^)cxexp (-5>^(«)) 2 /(2*«)) 



(3.260) 
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From Eq. (3.257), this means that we also have 

P(<p\I [0 , v) ) cx exp (- J2(W(v)) 2 /(26v)^ . (3.261) 

Here the prime on 6W'(v) indicates that it is calculated from the measurement record based on an 
assumed system phase, similarly to the previous section. 

It is obvious that the fitted phase discussed in the previous section is the most probable phase, 
as it minimises ^2(8W (i>)) 2 and therefore maximises P(tp\lm v -\). To find the optimal phase that 
minimises the final phase uncertainty, we need to calculate 



= arg / e t,p P(<p\l [QtV) )d<p 

— TV 
7T 

= arg J e tip exp (- ^(SW' (v)) 2 / '(28vj) dip. (3.262) 

— TV 

For a coherent state this simplifies to 

= arg J exp (Re 2A* \a\ + B* \a\ 2 e 2 ^ ) dxf>. (3.263) 

— TT 

It does not appear to be possible to evaluate this integral analytically, and it would need to be 
evaluated numerically. 

The calculation is even more difficult in the case of squeezed states. As was discussed in the 
previous section, for squeezed states the entire calculation (i.e. determining the time evolution of the 
state) must be repeated for each value of <f>. For numerical minimisation the number of values of cf> 
for which the calculation must be performed is on the order of 10, which means that the calculations 
are more lengthy, but not infeasible. For a numerical integral thousands of function evaluations 
would be required for an accurate result, making this method infeasible. For this reason, these 
optimal phase estimates were not used for dyne measurements in this study. As will be seen in 
Ch. however, it is much easier to determine the optimal phase estimates in interferometry. 



Chapter 4 

The Effect of Time Delays 



4.1 Introduction 

In practice the adaptive dyne measurements described in the previous chapters cannot be performed 
exactly. In any experiment there will be imperfections, for example calibration errors. In making 
phase measurements a major source of error is inefficient photodetectors. This is particularly the 
case for single photon photodetectors. These are photodetectors that are designed for distinguishing 
between, for example 1 or 2 photons. These photodetectors currently cannot be made with efficiencies 
above about 87% @. 

High amplitude photodetectors, on the other hand, can be made with far higher efficiencies, 
around 98% E). This is the sort of photodetector required for dyne measurements, due to the large 
amplitude local oscillator field. It is fairly straightforward to determine the extra phase uncertainty 
due to inefficient photodetectors. In the introduction (and Ref. j55|), it is shown that when the 
photodetector efficiency is 77, the extra phase variance is approximately 

irfn ^ ^ 

This extra phase variance means that, for large photon numbers, phase feedback schemes can 
only reduce the phase variance by a factor of about 1 — 77. This will be true for mark II measurements, 
as well as the more advanced phase feedback schemes described in the preceding chapter. For current 
photodetectors, this extra phase variance is more significant than the introduced phase variance for 
mark II measurements for photon numbers above about 1000. Below this the extra phase variance 
for mark II measurements is only marginally above the theoretical limit, and so it is not possible to 
greatly reduce the phase variance using more advanced feedback schemes. 

Another imperfection that is specific to phase measurements with feedback is the time delay in 
the feedback loop. This contribution is more difficult to estimate. Some highly simplified calculations 
indicate that the excess phase variance due to time delays is r/2 for mark I measurements (where r 
is the time delay), and r/(2n) for mark II measurements [ |35| . 

In Ref. |H I repeated these derivations more rigorously, and this chapter is based on that 
paper. While the result for mark I measurements is reasonably accurate, the perturbation approach 



is inadequate to obtain a consistent result for mark II measurements. In Sec. 4.2 an alternative 



derivation is considered that gives the minimum phase variance when there is a time delay. The 



phase variance with time delays is evaluated numerically in Sec. 4.4, and it is shown that for most 



of the measurement schemes the phase variance approaches this limit for large time delays. 



4.2 Perturbation Approach 
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4.2.1 Mark I 



Firstly I will estimate the effect of time delays on simplified mark I measurements in a similar way 
as in Ref. but using fewer of the simplifications used there. Without a time delay the stochastic 
differential equation for the phase estimate is 



I(v)dv 



= v 1/2 {-2asmtp v dv + dW(v)}. 



(4.2) 



Here the input phase has been taken to be zero for simplicity. For some time Vi the phase will come 
to lie near 0, so we can linearise around (p v = 0. The result, which will be valid for v\ < v < 1 is 



dip v = v 1/2 [-2ct(p v dv + dW(v)}. 



(4.3) 



We wish to consider the limit of large a, so that v\ is small, and this linearisation is accurate for 
most of the measurement. Including the time delay the SDE is 



dtp v = v 1,2 [-2aif> v - T dv + dW(v)]. 



(4.4) 



Now the time delay will be treated perturbatively. The solution to the perturbed equation can be 
written as 

V = ^ + arffl + 0(a 2 r 2 ). (4.5) 

Note that for this approximation to be accurate, qt must be small, in addition to a being large. 
The zeroth-order term obeys the SDE for no delay (4.3), so the first-order correction obeys 



ar#W = 2m;- 1 /2(^(0) - 0<°_> T )dt> - 2a 2 rv-^ 2 ^l T dv. 

Therefore to first order in r we have 

#« = 2v- 1 ' 2 d^ - 2av~ 1 ' 2 ^dv 

= 2v-^ 2 {v- 1/2 [~2aip^dv + dW(v)}} - 2av~ 1/2 ^ ] dv 

= -2av^/ 2 ^dv - —^dv + -dW(v). 

V V 

It is straightforward to show that the solution to the zeroth order equation is 



(4.6) 



(4.7) 



, 3 (o) _ pMv^T-V^Wo) 



4a(^/u-^/v) 



-dW(u). 



(4.8) 



Using this in Eq. (|4.7| ) and multiplying on both sides by e ia ^" gives 



4a 
v 



d(e 4a^(l)) 

Integrating then gives the solution 



0(1) = e 4a(V5T-V5)0(l) 



-dW{u) 



dv + -e 4a ^dW{v) 
v 



4a 
s 



e 4a(VoT-Vo)^(0) + 



a 4,a(\/u— y/v) 



-dW{u) 



ds 



dW{s). 



(4.9) 



(4.10) 
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In this approximation the mark I phase estimate is given by (p\ — ip^ + arip^ . To first order 



in r and a 1 this has a variance of 



i(0) "(1) 



(4.11) 



Here the known variance of l/(4a) of the zeroth order term has been used. Evaluating the second 
term on the right hand side wc find 

= e- 8 ^-^)(^)0W)+4alog( Wl )e-Mx-^r)^ ( ^(o) )2 ^ 



4a f e Sa (Vu-i) 



duds 



s 



„1.5 



-du. 



(4.12) 



The first two terms decrease exponentially with a and may therefore be omitted. Exchanging the 
order of the integrals in the third term and integrating gives 



,1.5 



-du. 



(4.13) 



To perform these integrals it is convenient to change variables to s = 1 — y/u, so du = —2(1 — s)ds. 
Then we obtain 



-(0) -(1) 



16a 



(1 - s) 



i-v^T 



(1 - s) 



-ds. 



(4.14) 



Expanding in a Maclaurin series in s gives 



-(0) -(1)\ lr 

<Pi <Pi / = — loa 



-16a 



J {s + ^s 2 + 0(s 3 ))e- Bas ds + A J (l + 2s + 3s 2 + 0{s 3 ))e~ 8as ds 



1 



+ 7 -^- T + 0(a" 4 ) 



(8a) 2 (8a) 3 



1 

8a 



+ 7^ + 0{a- i ) 



Sa) 2 (8a) 



4a 



+ 0(a- 2 ). 



(4.15) 



Note that the upper bound at 1 — ^/v\ has no effect since it gives a term that decays exponentially 
with a. Using this result, the total phase variance is 



1 T 

4a" + 2' 



(4.16) 



This provides a good verification of the result obtained by the highly simplified method in |3J|. 

Note that this result is based on continuing to use the intermediate phase estimate at the end of 
the measurement. If the phase estimate aigA is used at the end of the measurement, the result will 
be different, and cannot be predicted using this approach. 

4.2.2 Mark II 

For the mark II case it does not seem to be possible to obtain a consistent result using this approach. 
To illustrate this, I will briefly outline the derivation (the details are in Appendix A. 3). From p5[ , 
the mark II phase estimate is effectively a time average of the mark I phase estimates: 



(p t dt. 



(4.17) 
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In order to make this consistent with the above theory, we should take the average only from time 
vi , then take the limit of small v\ . In perturbation theory the mark II phase estimate is 

1 

<ki = J W ( t ] + aT<p { t 1] }dt. (4.18) 

Using this expression the variance is 

11 11 

(4) =JdtJ dtf + 2ar JdtJ dt> (^ 0) 4 13 ) ■ (4-19) 

Vl Vl V± Vi 

It is shown in Appendix [A.3| that the first term can be simplified to 

Vi Vl 

which is similar to that obtained in |35f| . For the second term, however, we get 
l l 

2ar JdtJ dt> (^W) « i (| (^W) - ^r((^ } ) 2 )) • (4.21) 

Vl Vl 

This is radically different to the result obtained in Ref. |55|, and seems to cast some doubt on the 
simplified theory used there. 

Unlike the result for mark I measurements, this result depends entirely on the conditions at time 
vi, which are unknown. In order to obtain a usable result, we would need the initial conditions to 
give a neglibible contribution, as is the case for mark I measurements. For this reason I will consider 
an alternative approach for estimating the increase in the phase variance due to the time delay. 



4.3 Theoretical Minimum 

The alternative method of obtaining an estimate for the phase variance with a time delay is to 
consider the squeezed state |a p ,£ P ) in the probability distribution. As was explained in Sec. jO| , 
the excess phase variance due to the measurement scheme is approximately the phase variance of 
this squeezed state. 

From Ref. jl5| , the phase variance of a squeezed state is given by 

(A<j> 2 ) « + 2erfc (V2no) , (4.22) 

where no = ne 2 ^ for real £. To determine the excess phase variance, we would use the values n p and 
£ p from the squeezed state in the POM, rather than those for the input state. The average value of 
n F will be close to the photon number of the input state. 

For states that are significantly less squeezed than optimum, the second term is negligible and 
we can omit the term of order (n p )~ 2 . Then this simplifies to 

< A ^)«^P- (4-23) 

Since n p will be close to the photon number of the input state, it is reasonable to replace it with n. 

When there is a delay of r in the system, before time r we have no information about the phase 
of the system to use to adjust the local oscillator phase. Therefore we must use a heterodyne scheme 
for this time period, rapidly varying the local oscillator phase. This means that B T will be equal to 
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zero, and no matter how good the phase estimate is after time r, the largest the magnitude of B v 
can be made is v — t. Then at the end of the measurement, the largest \B\ can be is 1 — r, and the 
largest |£ p | can be is atanh(l — r). 

The lower limit to the introduced phase variance when there is a time delay of r is therefore 



(A</> 2 ) 



g — 2atanh(l — r) 

k <4 - 24 > 



We can expect that the introduced phase variance will be close to this for states of small intrinsic 
phase variance, as there will quickly be very good phase estimates available for the feedback. In 
addition, the time delay must be sufficiently large that the phase variance given by this expression 
is significantly above the introduced phase variance for no time delay. 

This result obeys the same scaling law as the result given in J|5| , but it is a factor of four times 
smaller. Note, however, that the limit condition for the result given in |35f| is that ra is small, 
whereas the above result should only be accurate when both a and r are reasonably large. The 
result here also differs in that it is the limit for the total introduced phase variance, rather than just 
the extra phase variance due to the time delay. 



4.4 Numerical Results 

These analytic results were also tested numerically. The numerical techniques used were similar to 



those described in Sec. 3.3. Minimum uncertainty squeezed states were used, with the stochastic 
differential equations for the squeezing parameter a as given in Eq. (3.141) and the value of as 



given in Eq. (3.143). For all calculations 2 20 time steps were used, and calculations were performed 
with time delays of 2™ time steps, where n varies from to 18. 

For most of these calculations the same random numbers were used for each time delay in order 
to see the difference in the variance due to the time delay more accurately. If this is not done, the 
differences between the variances for the different time delays are primarily due to the variation of 
the random numbers, rather than the different time delay. Using the same random numbers makes 
the results correlated, so that the differences are primarily due to the time delay. 

For the first 2™ time steps the local oscillator phase was rotated by tt/2 each step. For the 
following time steps the data up to the time step 2" before the current time step was used. For a 
delay of 2° = 1 time steps the data from the previous step was used, corresponding to the technique 
for no time delay. 

Numerical results for four different phase feedback schemes were obtained: 

(a) The simplified feedback for mark I and II measurements, where 

dl(v)dv 

dify = ' . 4.25) 

spu 

(b) The unsimplified feedback, where the phase estimate is 

<p(v) = &rgA v . (4.26) 

(c) The phase estimate that is intermediate between argA and the best phase estimate 

^)=arg(^Ci- £ ), (4.27) 



where e is a constant. 

(d) The same as in (c) , except that the value of e varies with time as 
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time delay 

Figure 4.1: The extra phase variance (in the final value of the intermediate phase estimate for the 
simplified feedback) due to the time delay plotted as a function of time delay for four different mean 
photon numbers. The data for a mean photon number of 121.590 are shown as crosses, for a photon 
number of 1576.55 as circles, for a photon number of 22254.8 as asterisks and for a photon number 
of 332067 as pluses. The approximate analytic result r/2 is plotted as the continuous line. 



4.4.1 Comparison with Perturbative Theory 

I will firstly consider the case of simplified feedback, and consider the variance in the final value of 



the feedback phase, rather than the phase of A or C. This case was examined in Sec. 4.2.1, and 



the extra phase variance due to the time delay is r/2 according to that analysis. The extra phase 



variance is plotted for four different mean photon numbers in Fig. 4.1. For each of the points shown 
2 13 samples were used. 

To determine the extra phase variance due to the time delay, an estimate must be made of the 
phase variance with no time delay. The most convenient estimate to use is the minimum variance 
obtained, as this prevents negative data points that cannot be plotted on a log-log graph. The 
minimum variance is not necessarily that for the smallest time delay, due to the stochastic nature 
of the calculations. 



The theoretical asymptotic value of r/2 is also plotted in Fig. 4.1. As can be seen, many of 
the results are close to the theoretical line for the intermediate time delays. For small time delays, 
the extra phase variance due to the time delay is too small a fraction of the total phase variance 
for the results to be accurate. The reason why the results deviate from the asymptotic result for 
large time delays is that this result is for the limit of small or. Note also that the results for larger 
photon numbers deviate from the asymptotic result for smaller r than the results for smaller photon 
numbers. This is what can be expected from this limit condition. 

It is also possible to use fitting techniques to determine how closely the numerical results agree 
with the theoretical asymptotic result of r/2. The data and fitted lines for mean photon numbers 



of 121.590, 1576.55, 22254.8, 332067 and 5122478 are plotted in Figs |4J, |4J, ^5| and |4iS 
respectively. For each of the points shown 2 14 samples were used. These results were determined 
using independent random numbers for each data point. Calculations were also performed using the 
same random numbers, in order to reduce the relative error between the data points. (These are the 
data points plotted in Fig. |4.l|.) Unfortunately this method tends to produce systematic error in 
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time delay 
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Figure 4.2: The Holevo variance in the final value of the intermediate phase estimate for a mean 
photon number of 121.590. The results are shown as the crosses and the continuous line is that 
fitted to the data. 



the slope, which is not reflected in the uncertainty. It is therefore better to use independent random 
numbers but a large number of samples to estimate the slope. 

Some of the initial data points have been omitted in each graph, as these were too close to each 
other to be useful. The data points for larger time delays have also been omitted. These tend to be 
less accurate, as the approximation is in the limit of small err. More specifically, for most of these 
graphs the data points for qt > 0.08 were omitted. The slopes of the fitted lines found were 

S 4 . 2 = 0.629 ±0.019, 
50 = 0.490 ±0.032, 
50 = 0.590 ±0.098, 
£0 = 0.525 ±0.039, 
O = 0.498 ±0.075. 



Except for the first result, these results are all consistent with the theoretical value of 0.5. The third 
result is higher than 0.5, but also has a large uncertainty. Note that, due to the linearisation in 
Eq. (4.3), we can only expect the r/2 result to be accurate in the limit of large a. The reason for 
the larger slope for the smallest photon number is likely to be that a is not sufficiently large for the 
linearisation to be accurate. 

As was mentioned above, the result for the additional phase variance due to the time delay is 
only valid for the variance in the final value of the phase estimate, which is not the same as arg A 



when there is a time delay. In Fig. 4.7 I have plotted the variation of the phase variance with time 
delay for three alternative final phase estimates, <pi, arg A and argC. This is for a photon number 
of approximately 332000, and is fairly representative of the results for other photon numbers. For 
these results, and the rest of the results in this chapter, 2 11 samples were used for each data point. 

As can be seen, for very small time delays the variances in the (pi and arg A phase estimates 
are almost identical. As the time delay is increased, however, the variance of ipi increases, but the 
variance of arg A decreases. This is because, as the intermediate phase estimate gets worse, the value 
of \B\ decreases. This means that A is closer to C, so arg A is closer to the best phase estimate. 
Note, however, that the variance of arg A rises again, and does not converge to argC for large time 
delays. This is because \B\ does not fall to zero. 
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Figure 4.3: The Holevo variance in the final value of the intermediate phase estimate for a mean 
photon number of 1576.55. The results are shown as the crosses and the continuous line is that 
fitted to the data. 
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Figure 4.4: The Holevo variance in the final value of the intermediate phase estimate for a mean 
photon number of 22254.8. The results are shown as the crosses and the continuous line is that 
fitted to the data. 
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Figure 4.5: The Holcvo variance in the final value of the intermediate phase estimate for a mean 
photon number of 332067. The results are shown as the crosses and the continuous line is that fitted 
to the data. 




Figure 4.6: The Holevo variance in the final value of the intermediate phase estimate for a mean 
photon number of 5122478. The results are shown as the crosses and the continuous line is that 
fitted to the data. 
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Figure 4.7: The variance of three alternative final phase estimates for simplified feedback with a time 
delay plotted as a function of time delay. The results for the final value of the intermediate phase 
estimate are plotted as a continuous line, for arg A as a dotted line, and for argC as a dash-dotted 
line. All results are for a photon number of 332067. 



4.4.2 Comparison with Theoretical Minimum 

Lastly the variance in the phase of C will be considered. As was explained above, the theoretical 
lower limit to the introduced phase variance is r/(8n). I have plotted the introduced variance in 
the best phase estimate arg C and the theoretical limit in Fig. 4.8 . For additional accuracy I have 
plotted 

„ — 2atanh(l — r) 

— S— ■ (4 ' 29) 

as this will continue to be accurate for time delays that are a large fraction of 1. This plot is for a 
photon number of 332000, and similar results are obtained for other photon numbers. In the case of 
simplified feedback, the phase variance is well above the theoretical limit. For large time delays the 



phase variance approximately converges to the heterodyne phase variance, also shown in Fig. 4.8 



The introduced phase variance for mark II measurements with the unsimplified arg A v feedback 
is also shown in Fig. 4.8. The introduced phase variance for this case increases far more slowly with 
the time delay, and for larger time delays it is very close to the theoretical limit. These results 
indicate that if there is any significant time delay in the system, the simplified feedback will give a 
far worse result than using arg A v . 

It is possible to make a correction to the simplified phase feedback scheme that improves this 
result somewhat. Many different alternatives were tried, and the one that gave the best results was 



d<p v 



I(v)dv 
\/v + ar 



(4.30) 



This correction is based on the fact that \A V \ is larger than \/v when the phase estimate is worse 
than argA„. (From |3q ), the factor of \/v in the simplified feedback comes from a factor of \A V \.) 
The results for this correction are also shown in Fig. 4.8. The phase variances obtained in this case 



are significantly below those for the plain simplified feedback, but are still far above the results for 
the unsimplified argA„ feedback. 
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delay 



Figure 4.8: The introduced phase variance for three different phase feedback schemes plotted as a 
function of time delay. The dotted line is for simplified feedback, the dash-dotted line is for the 
corrected simplified feedback, and the circles are for unsimplified &rgA v feedback. The best phase 
estimate arg C is used in all three cases. The continuous horizontal line is the phase variance for 
heterodyne measurements, and the continuous diagonal line is the theoretical limit. All results are 
for a photon number of 332067. 
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Figure 4.9: The introduced phase variance for better intermediate phase estimates plotted as a 
function of time delay. The pluses are for the constant e case and the crosses are for the time 
dependent e case. The theoretical limit estimated using the mean inverse photon numbers obtained 
from the time dependent e case is plotted as the dotted line, and the theoretical limit using the input 
photon number is shown as the continuous line. All results are for a photon number of 332067. 



Now we will consider the results for better intermediate phase estimates that are between arg A v 
and arg C v . The introduced phase variance for the constant e case and the theoretical limit are 
shown in Fig. 4.9. These results are again for a photon number of about 332000. The results for 
this case are even closer to the theoretical limit than those for the mark II case. 

The introduced phase variance for the feedback with time-dependent e is also plotted in Fig. [Q| . 
The results for this case converge to the theoretical limit at smaller time delays than for the constant 
e case. For the larger time delays the results for the two cases are about the same, slightly above 
the theoretical limit. 

In both cases the phase variance is still noticeably above the theoretical limit for large time 
delays, and the values of B obtained are too close to 1 — r to account for this difference. The 
difference appears to be due to the approximation that the photon number of the state |a p ,£ p ) is 
close to the photon number of the input state. The average value of this photon number is close to 
the photon number of the input state; however, each individual value is not necessarily close to n. 
The expression for the introduced phase variance depends on the inverse of the photon number, and 
the average of an inverse is not necessarily equal to the inverse of an average. The general expression 
is 



1 



(An 2 ) 



+ 0((n)- 4 ). 



(4.31) 



In Fig. 4.9 I have also plotted the estimated theoretical limit based on the average of l/n p for the 



data obtained in the time dependent s case. Specifically, the expression plotted is 



-2atanh(l- 



(4.32) 



As can be seen, the introduced phase variance converges to this far more closely than to the limit 
based on the photon number of the input state. 
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time delay 

Figure 4.10: The introduced phase variance for three different feedback schemes plotted as a function 
of time delay. The circles are for mark II measurements, the pluses are for the constant e case, and 
the crosses are for the time dependent e case. The theoretical limit estimated using the input photon 
number is shown as the continuous line. All results are for a photon number of 5122478. 



For larger photon numbers this factor is not so significant, and the difference between the results 
and the theoretical limit is smaller. For example, the results for the three different feedback schemes 
(ma rk II constant e and time dependent e) for a photon number of about 5 x 10 6 are plotted in 
Fig. |4~10|. 



Chapter 5 



Optimum Input States for 
Interferometry 

5.1 Introduction 

In chapters || to || just a single mode of the electromagnetic field was considered. The reference 
phase was provided by a local oscillator field, which was assumed to be sufficiently large amplitude 
that it could be treated classically. The main alternative to this is to consider two modes, both of 
which are treated quantum mechanically. Rather than measuring the absolute phase, we now wish 
to measure the phase difference between the two modes. The simplest way of considering this is via 
the Mach-Zehnder interferometer, as in Fig. |5.l[ 

With the Mach-Zchndcr interferometer, an initial two-mode input state is fed into a beam splitter. 
The two beams are then subjected to phase shifts of <I> and <p, then recombined at a second beam 
splitter. The outputs of this beam splitter are then detected using photodetectors. The counts at 
the photodetectors are used to obtain an estimate of the phase difference. 

These photodetectors will be assumed to be ideal for this analysis, though current photodetectors 
can only achieve a sensitivity of around 87% |Q . The photodetectors required for this need to be able 
to distinguish individual photons, in contrast to the large intensity photodetectors required for dyne 
detection. This is a more difficult task, and the efficiencies currently possible are correspondingly 
lower. In this study, however, we are concerned with what is possible in principle using foreseeable 
technologies. I will therefore be considering only the case of unit efficiency. 




Figure 5.1: The Mach-Zehnder interferometer, with the addition of a controllable phase $ in one 
arm. The unknown phase to be estimated is ip. Both beam splitters (BS) are 50/50. 
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In general the interferometer is used to measure the phase difference between the two arms. 
Similarly to the single mode case, we wish to add an auxiliary phase shift $ in order to obtain a 
more accurate phase measurement. For simplicity I have indicated the phase we wish to measure, 



if, as the phase shift in one arm in Fig. 5.1. In the following analysis this phase can equivalently be 
taken to be the phase difference between the two arms in the absence of the auxiliary phase shift $. 

It is well known that it is possible to obtain a phase uncertainty scaling as 1/y/N (the standard 
quantum limit) when a photon number state with N photons is input to one port of the interfer- 
ometer. Several authors |l], |39|, [I(], [II], |^| have proposed ways of reducing the phase uncertainty to 
the Heisenberg limit of 1/N. Most of these proposals |], [39[ |Ij are essentially for detecting small 
deviations from some known phase, and therefore only give a 1/iV scaling for a very small range of 
phases. Sanders and Milburn |4l], [|2| considered ideal measurements, that give 1/N scaling indepen- 
dent of the phase. Unfortunately they do not discuss how these measurements can be achieved in 
practice, and as will be shown in Ch. |^ it is not possible to perform these measurements in general, 
even allowing feedback. 

We wish to perform measurements as close as possible to ideal by varying $ during the measure- 
ment, based on the detections. In general there are three different areas available for optimisation: 

1. The initial input state. 

2. How the feedback phase is changed during the measurement. 

3. The final phase estimate. 

In this chapter I discuss the optimum initial input states, and in the next chapter the feedback phase 
and final phase estimate are discussed. 



5.2 Optimum Input States 

The input states are most conveniently described using the Schwinger representation. The operators 
in this representation are 

J x = (a)b + atf)/2, 
J y = {(Jb-atf)/2i, 
J z = (a)a-tfb)/2, 

J 2 = J 2 + J 2 + J 2 . (5.1) 
The operators J x , J y and J z satisfy the commutation relations for the Lie algebra of SU(2): 





= iJ z , 




Jy-> Jz 


— tJx) 




J Zl J X 


= Uy. 


(5.2) 



The operator J 2 is the Casimir invariant for this group (i.e. it commutes with all elements). These 
commutation relations are the same as those for the operators for components of angular momentum. 

I will use the notation \jfJ,) z for the common eigenstate of J z and J 2 , with eigenvalues of /i and 
j(j + 1) respectively. This corresponds to Fock number eigenstates at the interferometer inputs with 
photon numbers in ports a and b of j + fi and j — fi respectively. Similarly the notation \ jfj) y means 
the common eigenstate of J y and J 2 with eigenvalues of ji and j(j + 1). This state corresponds to 
number eigenstates in the interferometer arms with photon numbers of j + fj, and j — fi. To see this, 
note that the annihilation operators for the modes in the two interferometer arms are 

(a + ib) /V2, {ia + b)/V2. (5.3) 

It is simple to show from this that the operator for the difference in the photon numbers beween the 
arms is 

\{-ia) + 6 f )(m + b) - i(a f - zfe t )(a + ib) = (a^b - ab ] )/i, (5.4) 
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which is twice J y . 

Note that the scattering matrix used here for the beam splitter is 

(5.5) 

In contrast, the scattering matrix considered in the case of dyne measurements was 

(5.6) 



1 

71 



1 i 

i 1 



1 

71 



i i 

-i i 



These scattering matrices have been used for consistency with previously published work. The only 
important consequence of this difference is that for dyne measurements, small errors are obtained 
when the difference between the signal and local oscillator phases is tt/2, whereas in the interferom- 
eter case the phase difference should be close to zero. This is because, in the interferometer case, 
the i in the scattering matrix gives a tt/2 phase shift. 

In order to represent a completely general state we can express it as a sum of input number 
states: 

oo j 

Most proposals for reducing the phase uncertainty to the 1 /N limit consider only input states with 
a fixed total photon number of N = 2j. This restriction is also applied in this study, as it greatly 
simplifies the analysis. With this restriction, the input state can be represented as 

W) = £ ^IJM> 2 - (5.8) 

Similarly to the single mode case, the probability distribution for the estimate of the phase of a 
two mode input state, ip, is in general given by 

P{$) = Tr [pF(0)} , (5.9) 

where F(tp) is the POM for the measurement. This only depends on the intrinsic phase of the two 
mode state, and not the phase shift in the interferometer. The interferometer transforms the input 
state to I(ip)\ip), where ip is the phase in the interferometer and 



I ((p) = exp y— iipJyj . (5.10) 
The state matrix therefore transforms to 

P ' = i(<p) P P(<p). (5.ii) 



When this transformed state is used in Eq. ( |5.9| ) the probability distribution will be dependent 
on the interferometer phase. This transformation acts to shift the phase of the state by ip, so the 
probability distribution will depend on the sum of the phase of the input state and the interferometer 
phase shift. 

We can alternatively include the interferometer in the POM, and use the input state in Eq. (pT9|). 
Then we would transform the POM to 

F'(v)=P( i p)F(<p)I( l p). (5.12) 

If the POM describes a shift invariant phase measurement, then this can be simplified to 

= F(cj>), (5.13) 
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where <f> is the error in the phase estimate. Alternatively <fi can be taken to be the phase estimate 
when the interferometer phase shift is zero. 

Sanders and Milburn |4l], Q considered what they call "optimal" measurements, where the POM 
is given by 

2j + 1 



2tt 

where the \j<f>) are normalised phase states given by 



(5.14) 



|#) = (2i + l)- 1 /2 g 



(5.15) 



As I will show below, this POM is equivalent to the ideal or canonical POM for a single mode. For 
this reason I will generally use the same terminology for this POM, and reserve the word "optimal" 
for the best possible measurements that are realisable using photodetection and feedback. 
When expressed in terms of the eigenstates of J y , the canonical POM is 



i{n-v)cj> 



\m)y{ji 



(5.16) 



This is very similar to the POM in the single mode case when there is an upper limit of N = 2j on 
the photon number. This POM is not given explicitly above; however, it can be obtained by taking 
the single mode canonical POM given by Eq. (1.36), and limiting the sum to N. Explicitly, this 
POM is 

1 N 

F"»(^) = _ V" e^ n - m ^\n)(m\. (5.17) 

V.7T ' J 



n,m— 



To show that the interferometer POM of Eq. ( |5.14 ) is completely equivalent to this, we can make 
the change of variables 



A 1 = M + 3 
v' = v + j 

Im') = \j»y 



With this change in notation, the POM is 

F 



N 



(5.18) 



(5.19) 



This POM is identical to the POM for the single mode case for an upper limit on the photon number 
of N. This case was considered in Sec. |2.1.1| , and the state that minimises the Holevo phase variance 
is 



sm 



/i'=0 



2.7 + 2 



(5.20) 



Converting the notation back to the original variables, this state is 



\4<o 



i j 

= y 

+i ^ 



2j + 2 



(5.21) 



The minimum Holevo phase variance corresponding to this state is 

VU) =tan 2 f — - 



(5.22) 
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The states \j(J>) y correspond to joint number states within the interferometer, and they do not 
correspond to input number states in a simple way. We therefore wish to re-express the optimum 
states in terms of the input number states \jfi) z . To do this, we require the relation from Ref. H], 



s (^lF)^e l W 2 »^/^(V2), 
where P (t:/2) are the interferometer matrix elements given by 



IUn/2) = 2" 



(i-A«)! (?' + /*)! 



1/2 



P 



(0), 



for 



H — v > 0, fi + v > 0, 



where PfC'^ are the Jacobi polynomials, given by 

P^H*) = 2- n ± (^J) (^) (* + !)' 

m=0 V ' V / 

Therefore the explicit expression for determining the values of P (tt/2) is 



P v (tt/2) = 2-J 



(5.23) 

(5.24) 
(5.25) 

(5.26) 



ti-v)\ (j + v)\ 

= 2^[{j-^)\{j + ^)\{j-v)\{j + vy] 1 ' 2 

i-v- 

x [ni\(j — v — m)\(j — [i — m)l(p + v + m)!] 1 (— l) 3 



(5.27) 



The values of Ij lv (ir/2) for values of /i and v that do not obey the inequalities (|5.25|) can be obtained 
using the symmetry relations 



Using the result from Eq. (|5.23| ) , the state expressed in terms of the eigenstates of J z is 



opt/ 



E 



Sill 



f-l,V- 



(fi + j + l)n 



2.7 



e^CM-O/i (7r/2)|jV), 



(5.28) 



(5.29) 



An example of an optimum state for 40 photons calculated using this formula is shown in Fig. p.2j . 
As can be seen, the only significant contributions are from 9 or 10 J z eigenstates near fi = 0. In 
addition, the distribution near the centre is fairly independent of photon number, as can be seen by 
comparing this state with the state for 1200 photons, also shown in Fig. |]^. For larg er values of fi, 
the contributions fall approximately exponentially with n, as can be seen in Fig. |5.3| . 

In Ref. |39| it is shown that it is possible to obtain states similar to \j0) z and \jl) z using a two- 
mode four-wave mixer. As the optimum input states have their main contributions from these states 
and other J z eigenstates near /j = 0, this suggests that it may be possible to produce states that are 
close approximations of the optimum input states using a suitable modification of the apparatus used 
in Ref. p9| |. Unfortunately, rather than producing only the state \j0) z (for example), the four- wave 
mixer produces a superposition of these states with a range of values of j. 

In |3^] the authors state that the value of j can be inferred after the measurement from the 
number of photons detected. This would not be appropriate for the measurements considered in the 
next chapter, as these rely on knowing the value of j before the measurement starts. In the limit of 
large photon number the spread in the values of j will be small compared to the mean, so it may be 
possible to obtain good measurements using the mean value of j. Unfortunately, the measurement 
scheme considered in the next chapter is computationally infeasible in the limit of large j. 
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Figure 5.2: The coefficients z {j lA^Popt) for the state optimised for minimum phase variance under 
canonical measurements. All coefficients for a photon number of 2j = 40 are shown as the continuous 
line, and those near \i = for a photon number of 1200 as crosses. 
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Figure 5.3: The coefficients z (j^\ipopt) for larger values of fj, for the state optimised for minimum 
phase variance under canonical measurements for a photon number of 1200. 
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Figure 5.4: The minimum number of J z eigenstates required to approximate the optimum state in 
order to obtain a phase variance less than twice optimum. 



Nevertheless, it should be simpler to produce a small number of J z eigenstates than the entire 
range. In order to estimate about how many J z eigenstates are required to provide a reasonable ap- 
proximation of the optimum states for a given photon number, the coefficients for the optimum states 
were determined, and all except a number of coefficients near [i = were discarded. These remaining 
coefficients were then normalised, and the phase variance of the resulting state was determined. 



In Fig. 5.4 I have plotted the number of J z eigenstates required to approximate the optimum 
states for a variety of total photon numbers. The criterion used was that the phase variance of the 
approximate state be less than twice that of the exact state. For photon numbers up to about 400, 
only 9 J z eigenstates are required, but beyond that the number required increases fairly rapidly, 
with 17 required for 1600 photons. 



5.3 Phase Variances for Other States 

I will now consider the phase variances of some other commonly considered states in order to compare 
them with the optimum states. These states will be expressed in terms of the input photon number 
eigenstates: 

3 

W) = E ^IJM> 2 - (5.30) 

The probability distribution for the phase for these states will be given by 

P(4 > )=TrU)(^\F^)} 

' 2j + lr irU)(ii>mm 



2vr 

'l0W>| a . (5-31) 
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The inner product, when expressed in terms of the coefficients ipn, is given by 



0'<^#) = E ^{3^\3 v )e 
v=-j 

= (2j + 1)-V» £ 



(5.32) 



The state considered in Refs |^0|, [II], ^2) was |j'0) z , the state with equal photon numbers in both 
input ports. This state is the biggest contributor to the optimum states, so it is not unreasonable 
that this state should have a small phase uncertainty. This state suffers the drawback that it has 
equal peaks at and tt, and therefore must be considered modulo tt in order to obtain meaningful 
results. If we add the state \jl) z (the next biggest contributor to the optimum state), then we 
obtain a state for which we can consider the phase modulo 2tt. This state, (|j0) 2 + |jT) z )/v2, was 
considered in Ref. fffi ]. 

For the state |j0) z , the only non-zero coefficient is ipo = lj so the inner product is 



(#|jQ), = (2j + l)- 1 /2 J2 e-^+^o (tt/2). 



(5.33) 



The probability distribution is then given by 



P ^=2tt 



3 



_L £ e-^-^+^(li (n/2)Ylf l0 (n/2). 



(5.34) 



Because this state has equal peaks at and tt, the usual definition of the Holevo phase variance 
will give infinite results. Rather than using the usual definition, however, we can use a modified 
definition that is naturally modulo tt. The definition that I will use is 



W)=(Ke**>r a -l)/< 



(5.35) 



From Eq. (|5.34|) the value of |(e 2l<#, )| can be determined as 



|(e 2 ^)| 



1. J- e ^M"-"W J (4 (tt/2)) *4 (tt/2), 

E w(^W2)) f /;oW2) 



E (^-2,o(t/2)J ^o(tt/2) 

M=2-j 



(5.36) 



This can be used in Eq. ( p.35|) to determine the phase variance under this modified definition. 
Similarly, for the state (|j0) z + |jl) z )/v / 2, we have ip — tpi — l/y/2, so the inner product is 



M ((lio). 



lii) 



2 = 



v/2(2j + 1) 



3 -i/x(0+7T/2) 



^W2) + ^i (tt/2) . (5.37) 
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The probability distribution is therefore 
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(5.38) 



Using this expression for the probability distribution we find 
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(5.39) 



This result was used to determine the Holevo phase variance for this state. The third state that will 
be considered that does not have a simple solution for the Holevo phase variance is that with all 
photons in one port, \jj) z . For this state the only non-zero coefficient is = 1, so 



(M«>* = (2i + i)- 1/a £ e -^V^/ 2 w-^(7r/2). 

The probability distribution is therefore 

P ^) = ^ E ^ )W+7r/2) (^(T/2))*^(V2), 



(5.40) 
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and the value of (e*^) is 



1(^)1 



E (^>/ 2 ))*W 2 ) 



(5.42) 



The Holevo phase variance was calculated for the above thr ee st ates using these expressions, for 
photon numbers up to 25600, and the results are shown in Fig. 5.5. I have also included the analytic 
expression for the phase variance for optimised states. 

The phase variance for \j0) z scales down with photon number much more slowly than the phase 
variance for optimal states, and in fact even more slowly than the phase variance for \jj) z , which 
scales as iV -1 . In fact, for the range of photon numbers considered the phase variance scales as 
^y-i/2 This would seem to imply a phase uncertainty scaling as iV -1 / 4 , in dramatic contrast to the 
iV- 1 scaling found in @, glj, @. The state (|jQ) a + |jl) z )/ v / 2 is even worse, with a phase variance 
that does not scale down with photon number at all, and in fact never falls below 1. 

Another unusual feature of the graph is that the phase variance for \j0) z is even smaller than 
that for optimum states for very small photon numbers. This is not in fact a contradiction, because 
different measures of the uncertainty are used for the two states. For very small photon numbers, 
the phase probability distribution for optimum states is significant for phases beyond ±7r/2. The 
measure in Eq. (5.35) that is used for the phase variance for \j0) z effectively ignores the distribution 
beyond ±7r/2, so under this measure, this state has a correspondingly lower phase variance. 
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Figure 5.5: The canonical Holevo phase variance versus input photon number 2j. The continuous 
line is for optimum states \ip op t), the dashed line is for all photons in one input port \jj) z , the 
dotted line is for equal photon numbers in both ports \j0) z , and the dash-dotted line is for the state 
(Ijo). + \jl)z)/V2. 



The reason for the discrepancies between the results obtained here for the states \j0) z and 
(|j'0) z + \jl) x )/V2, and those obtained in g(], g| ||, is that the results in @ H §§ are all based 
on the width of the central peak in the distribution. In contrast, the Holevo phase variance for these 
states is primarily due to the tails. To demonstrate this for the state |j0) 2 , in Fig. |5.6| I have plotted 
the phase distribution multiplied by sin 2 <fi. 

The reason for multiplying by a factor of sin 2 (f> is that 

VAt) = (\{e^)\- 2 -l)/4 
= (<cos(20)) 
= ((1 - 2 sin 2 0) 



- 1 

-2 



'4 

l) u 



(1-2 (sin 2 </>» -1 

« [(l + 4(sin 2 0)) -l]/4 
= (sin 2 (f) . 



(5.43) 



The above approximation is accurate for small phase variance. This derivation also uses the fact 
that the phase distribution for this state is unbiased, so (e 2lc ^) is real. Note that in this form the 
variance is very similar to the standard variance, (0 2 )- Since (f> 2 > sin 2 c/>, the tails of the distribution 



are even more significant for the standard variance, as illustrated in Fig. 5.7. 

There is a similar problem for the state (\j0) z + "s/2- For this state there are peaks at ±tt, 

as shown in Fig. 5.8. Although these peaks are smaller than the main peak at 0, they do not get 
smaller with photon number. This means that the Holevo phase variance is almost entirely due to 
these peaks, and therefore does not decrease with photon number. As this state is so poor, I will 
not consider it further, and restrict attention to \j0) z and the optimum states. 
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Figure 5.7: The canonical phase probability distribution, multiplied by (j) 2 , for the state \j0) z with 
2j = 80 photons. 
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Figure 5.8: The canonical phase probability distribution for (\j0) z + \jl)z)/V2~ for 2j — 80 photons. 



The asymptotic expression for the phase probability distribution for \j0) z given in is 

2j + 1 [r(3/4)] 2 [Ji/ 4 (J0)] 2 



2tt 



2 3/2 



.)4> 



(5.44) 



This equation is approximately half what it should be in order to be normalised. The reason 
for this is that it approximates the exact distribution over the interval [— 7r/2, tt/2], but the exact 
distribution is normalised over [— 7r,7r], and repeats modulo tt. Since Eq. ( [5.44 ) only approximates 
the distribution over the region [— tt/2, tt/2], its integral over this region will be approximately \. 
Therefore the expression that I will be using is 



2j + 1 [r(3/4)f [Ji/idM]' 



2?r 



V2 



3< 



(5.45) 



This expression is correctly normalised in the limit as j goes to infinity. I have also added absolute 
value signs so that this expression is correct for negative values of (p. 
Now for large jcf> there is the approximate proportionality pq| 



This implies that for large 



This means that we should have 



2 I / |l/2 
OC \0 . 



(5.46) 
(5.47) 
(5.48) 



It can be seen in Fig. 5.7 that the scaling is nothing like this, and is closer to 



PjWtf oc |0| 



(5.49) 
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Figure 5.9: The canonical phase probability distribution envelope, multiplied by sin 2 (f>, for the state 
\j0) z with 2j = 51200 photons, calculated using three different methods. The exact calculation is 
indicated by the continuous line, the first approximation (5.51) is shown as the dashed line, and the 
Bessel function approximation (5.45) is shown as the dotted line. 



To see where this discrepancy originates, consider the intermediate approximation made in 
Ref. 0: 



(-1) J/2 



3/2 



-2ifj,(f> 



^/ 2 Wi + l)-(2M) : 
Using this approximation the probability distribution is 
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(5.50) 
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V — 

M i^ /2 b'(i + i)-(2M) 



2ll/4 



(5.51) 



Here I have multiplied by a factor of 2 so the probability distribution is normalised over the interval 
[-7r/2, tt/2], for consistency with Eq. fl5.45| ). In Ref. [|l) the sum is then approximated by an integral 
in order to obtain the Bessel function approximation. 



In order to see where the approximation is deviating from the exact expression, in Fig. 5.G I have 
plotte d the probability distribution multiplied by <f> 2 for the exact expression, the approximation 
(5.51), and the Bessel function approximation (5.45). For the exact expression I have used twice 
Eq. (5.34), so that the distribution is normalised over the interval [— tt/2, tt/2}. The functions are 



very rapidly oscillating, so to make the three curves legible only the peaks are plotted. 

Near <^ = the three expressions for the distribution give very similar results, but there are 
large differences in the tails. As can be seen in the figure, the first approximation (5.51) has tails 
that are fairly close to the exact expression, but still noticeably higher. In contrast, the tails for 
the Bessel function approximation are much different, and lower than the exact expression. These 
results indicate that it is not primarily the initial approximation for /^ (7r/2) that is giving the 
incorrect scaling for the tails, but the approximation where the sum is approximated by an integral. 

On the other hand, if we look at the results close to the centre of the distribution, we find that 
there is very good agreement between the curves. To illustrate this, the distribution near the centre 
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Figure 5.10: The canonical phase probability distribution, multiplied by sin 2 0, for the state \j0) z 
with 2j = 51200 photons, calculated using two different methods and restricted to a range of small 
<t>. The exact calculation is indicated by the continuous line, and the Bessel function approximation 



(5.45) is shown as the crosses. 



multiplied by sin 2 <p is plotted in Fig. 5. 10| . For large photon numbers there is good agreement over 



a region that is large compared to the central peak, but the agreement is always poor for phases 
that are significant compared to 7r/2. 

In the light of these results, any results based on the Bessel function approximation should be 
treated very carefully if they depend on the distribution for large phases. For example, the Bessel 
function approximation can be used to show that the Holevo phase variance should scale as N^ 1 / 2 



(as stated in |54J|), but it does not give the correct scaling constant. From Eq. (5.43) the Holevo 
phase variance is given approximately by 

tt/2 

(sin 2 (f) = J P((j>) sin 2 <j)d<j). (5.52) 

-7T/2 

Note that this approximation is accurate provided that (sin 2 <j>} is small, but does not depend on the 



main part of the contribution being from small 4>. Using the Bessel function approximation (5.45), 
we find that 



, 2. 7 + 1 [r(3/4)] 2 T [ J i/4(l#l)]\ ; „, 



(siir ^_______^ , , 



7T V2 J y/j4> 
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sin 2 4>d<t>. (5.53) 
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From Ref. p6[, for large cj> the Bessel function can be approximated by 



/■rf^ ry . (5.54) 



Therefore, since the majority of the contribution to the phase variance is from large <f>, we have 
/•2.\ gj + 1 F (3/ 4 )] 2 T 2 sin2 ti<t> + tt/8) . 2 

sin 0) » / ttt^ sin (5.55) 

Using the average value of sin 2 (j</> + 7r/8) and taking the limit of large j, this becomes 
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AT J 4> 3/2 



2 ^ 

(5.56) 



This demonstrates that the Bessel function approximation predicts that the Holevo phase variance 
should scale as N^ 1 / 2 . The scaling constant is given by 

2 ^..,,2 Tsin 2 ^,, 2 rwo//Al2 ^ (-l) n - 1 7r 2n V577T 



t{ (2n)!(4n 



= 0.2845775946062444 . . . (5.57) 

The above expression is easily evaluated by numerical evaluation of the integral, or by summing the 
first dozen or so terms of the s um. The scaling constant obtained is about 0.28; however, from the 
numerical results shown in Fig. , the actual scaling constant is 0.44. 

Note that for the standard variance, (</> 2 ), we are simply replacing sin 2 </) with (j) 2 , so the scaling 
is again TV -1 / 2 , except with a scaling constant of 
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= 0.3993800782451976 . . . (5.58) 

This scaling constant of about 0.40 is again different than the actual scaling constant of about 
0.66 (the results demonstrating this scaling constant will be discussed below). It is significant that 
the scaling constants for the Holevo phase variance and the standard phase variance arc different, 
because it demonstrates that the Holevo phase variance is not necessarily the same as the standard 
phase variance, even in the limit of very sharply peaked distributions. For the Holevo phase variance 
to be the same as the standard variance, the phase distribution must not only be narrowly peaked, 
but the tails must scale down rapidly enough that there is no significant contribution to the phase 
variance from large (j). 



5.4 Other Measures of the Phase Uncertainty 

In practice the high tails of the |j0) z state mean that although most of the results of phase mea- 
surements will have small errors, scaling as N~ 1 , there will always be a significant number of results 
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with large phase errors. This means that we would need to be very careful analysing results obtained 
from this state. For example, if we take the mean of a large number of results obtained using this 
state, the error in the mean will scale as iV" 1 / 4 rather than N^ 1 . In order to obtain results with 
error scaling as ./V -1 , we would need to use some more sophisticated data analysis technique. In 
contrast, because the optimum states derived here have a Holevo phase variance that scales as N~ 2 , 
we can use all the standard data analysis techniques and still get an error scaling as N^ 1 . 

Another issue is that although the phase uncertainty for \j0) z as indicated by the Holevo or 
standard phase variance does not scale as iV _1 , it does scale as TV -1 under other measures of 
uncertainty. The phase variance is a very stringent measure of uncertainty, and generally gives an 
upper limit on other measures of uncertainty. For example, there are the inequalities |5q, pq] 



A(f> 

> L C , 



VT^C 



(5.59) 



where Acf> is the square root of the variance, Lh is the entropic length, L-p is the Fisher length, and 
Lc is a C x 100% confidence interval. These inequalities mean that if A(j> scales as TV -1 , as is the 
case for optimum states, then the entropic length, Fisher length, and confidence intervals must also 
scale at least as N^ 1 . 

Two other measures of the phase uncertainty are the reciprocal-of-peak- value, L rp , and the 
Siissman measure, Lg. Specifically, the definitions for each of these measures are 



A^ 2 = / p {<t>){<t>-4>y 
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(5.60) 



where <fi is the mean phase, defined as 



P 



(5.61) 



In addition, there is the usual Holevo phase variance 
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Most of these measures are discussed in more detail in Rcf. 157 
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In order to calculate these measures, the full probability distribution for the measurement scheme 
is required. In the case of the optimum input state, the inner product is given by 
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Expressing the sine in terms of exponentials gives a sum that can be evaluated: 
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Using the summation formula 
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this simplifies to 
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The probability distribution is therefore 
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Simplifying this we find 
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(0030-003(^2)) 

This simple result for the probability distribution can be used to determine the phase uncertainty 
under each of the measures in (5.6C) using numerical integrals. 

For the state with equal numbers of photons in each port, the probability distribution is given 
by Eq. (5.34). This expression cannot be evaluated to a simple expression like Eq. ( |5. 68 ), and 
the full sum must be used in order to treat the distribution exactly. This becomes prohibitively 
time consuming if the integrals must be performed numerically; however, all of the integrals can 
be evaluated using sums, except for the entropic length. To see this, note that the probability 
distribution can be expressed as 



k=-2j 



where 
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Recal l that the modified Holevo phase variance for this state can be evaluated using ( 5.35 ) and 
(5.36). For the other measures of the phase uncertainty the integrals will be performed over the 
interval [— 7r/2, tt/2], so the above distribution must be multiplied by a factor of 2 to be correctly 
normalised. The modified probability distribution is therefore 
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In addition, it can be shown that all the Pk are real, and P^ — for odd k. For the standard phase 
variance, it is simpler to express the probability distribution as 
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The standard phase variance can then be determined using 
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Evaluating this we find that 
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This provides a simple formula to determine the standard phase variance, once the coefficients Pk 
have been determined. 

Next, to determine the Fisher information, note that for the state with equal photon numbers 
in both ports, the probability distribution can be expressed as 

P{4 > ) = ^{4 > ), 



with 



(5.75) 
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Here the factor of V ensures that this is always real, so the absolute value sign is not required. Note 
also that a factor of (rather than is required for the probability distribution to be 

normalised over the interval [— 7r/2, ir/2}. In terms of this function the Fisher information is 
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Evaluating this gives 
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In the last line the result that I°_ (w/2) = (— iy I J (w/2) has been used. This expression gives a 
simple method for calculating the Fisher length. 

Next, in order to evaluate the confidence interval we wish to perform the integral 

he 

1-C= J P(<j))d<j). (5.79) 



This is easily evaluated as 



1-C 



-L C 



Pa 2 21 

h - ^ Pk cos(k<j>) 

7T 7T *■ — ' 

k=2 

2j 



2L C P , 4^P fe . 



+ -Ex sin(/cLc) - 

k=2 



(5.80) 
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N 



Figure 5.11: The phase uncertainty of the \j0) z state (purple) and states optimised for minimum 
Holevo phase variance (black) under several measures multiplied by N. The square root of the 
Holevo phase variance is shown as the continuous lines, the square root of the standard variance as 
the circles, the inverse-of-maximal-value as the triangles, the Sussman measure as the dotted lines, 
the entropic length as the dash-dotted lines, the Fisher length as the crosses, and the 67% confidence 
interval is shown as the pluses. 



This expression can then be used to find Lc for a given C numerically. 

The evaluation of the reciprocal-of-peak-value is simple, as it merely requires the evaluation of 
the probability distribution at </> = 0. Lastly, the Sussman measure can be evaluated using 

*/2 



-ir/2 k,k'=-2j 

= - E p l ( 5 - 81 ) 



k,k'=-2j 



These methods were used to calculate each of the above measures of uncertainty for the optimum 
input state and \j0) z for a large range of photon numbers, and the results multiplied by N are plotted 



in Fig. 5.11. It is clear from this plot that the asymptotic values for most of these measures are good 
approximations for photon numbers of order 100 or greater. The results for the Holevo and standard 
variance for \j0) z do not converge, as these measures of the phase uncertainty scale as TV" 1 / 4 . The 
results for the entropic length for \j0) z do not appear to converge at this scale; however, if we 
increase the scale of the plot, as in Fig. |5.12 , we find that the results converge to a large value for 
very large photon number. 



It can be seen from Fig. 5.11 that the optimal state is actually worse than the \j0) z state as 
evaluated using the reciprocal peak L rp or the Fisher length Lp , but it is better under all the other 
measures of phase uncertainty. The Sussman measure and the entropic length are smaller for \j0) z 
for smaller photon numbers, but the asymptotic values are larger. 

It can be argued that the reciprocal peak and Fisher length do not give very meaningful estimates 
of the phase uncertainty in the case of \j0) z . The reciprocal peak gives good scaling because the 
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Figure 5.12: The entropic length for the |j0) 2 state multiplied by TV on a larger scale. 



probability distribution has a high peak, but it does not take the tails into account at all. As an 
extreme example of where this happens, consider the probability distribution given by 

1 - A 



2- 



(5.82) 



where A -C 1. Under the reciprocal peak measure, the phase uncertainty is zero; however, it is clear 
that the actual phase uncertainty is very large. 

The Fisher length is small when the probability distribution has a large first derivative. This 
means that the Fisher length gives a meaningful estimate of the uncertainty when the probability 
distribution has a smooth peak, as the larger the derivative is, the sharper the peak. The Fisher 
length is not so meaningful when the probability distribution oscillates, as is the case for |j'0) z . The 
small Fisher length is then just due to the rapid oscillations, rather than a single narrow peak. As 
an extreme example of this, consider 



P{4>) 



cos 2 (n<; 



(5.83) 



where n» 1. The Fisher length will be very small for this distribution; however, it is clear that the 
actual phase uncertainty is very large. In view of these considerations, it appears that the reciprocal 
peak and Fisher length give misleadingly small estimates of the phase uncertainty for the \j0) z state. 
The better scaling using these measures should not be taken to imply that \j0) z states are better 
than the optimal states. 



Using the results plotted in Fig. |5.11| we can determine the sca ling constants for each of these 

In this table A<^>h denotes the 



measures for the two states, and the results are listed in Table 5.1 



square root of the Holevo phase variance. Note that for |^ pt) the square roots of both the standard 
variance and the Holevo variance scale as tt/N, in agreement with the scaling predicted analytically. 
For \j0) z , the standard and Holevo variance scale as 



A<p 2 
v(4>) 



0.6573863 



<N 
0.4395 



N 



(5.84) 
(5.85) 
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Table 5.1: The scaling constants for each of the measures of phase uncertainty for the state optimised 
for minimum Holevo phase variance and |j0) z . 



Measure 


IV'opt) 


lio>, 


NAcj) 


3.14159265359 


0.8107937 x TV 3 / 4 


NA(j)n 


3.1415927 


0.66292 x N 3 ' 4 


NL rp 


7.7515691701 


6.87519 


NL S 


10.710529485 


12.30505 


NL U 


12.414819836 


35.79 


NL 2/3 


2.9481552495 


3.07129 


NL F 


2.76615948 


1.4142136 



These are different from each other, and also from the values obtained previously using the Bessel 
function approximation. 

It is interesting to note that the coefficients found here for the confidence interval (3.071) and 
the Fisher length (1.414) for the state \j0) z differ from those found in |42], of 3.36 and 2 respectively. 



In order to see the reason for this difference we can use the asymptotic approximation (5.45). In the 
large j limit the +1 can be ignored, so this becomes 



PM 



jT 2 (3/4) [Ji/ 4 (WD]' 



V2 v^f 
Therefore the 2/3 confidence interval can be determined as 



(5.86) 



1 = J 3 jT 2 (3/4)A 2 /4W). 



o 



7T \/2 



n4> 



3L2/3 



1 £2(3/4) / Jf/M 

71" V2 



-dx 



(5.87) 



where x — jcf>. Rather than using the asymptotic approximation of the Bessel function, as used in 
p2| , it is fairly straightforward to numerically evaluate the integral and use Newton's method to 
find jL 2/3 . This gives jL 2/3 « 1.53564794820384, so NL 2/3 w 3.07129589640767. This agrees with 
the value found from the exact calculations to six significant digits. Thus we see that the difference 
in the scaling constant is due to the asymptotic approximation for the Bessel function used in |42f . 
For the Fisher length, if we perform the derivation in the same way as in [E2I, we find 



1 



PM 



-1 2 



jT 2 (3/4) 



V2 



J 2 

J l/4 



d(f> 



(j<t>) 



So the Fisher information is 



j T 2 (3/4) Jj/t 
* V2 J VI 



= 2 Ti vnm , f n ! m t 

3 J TT V2 V3 





7T V2 



J 2 T 2 (3/4) Q TT 



TT V2 V8r 2 (3/4) 



(5.88) 
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Table 5.2: The scaling constants for each of the measures of phase uncertainty for the state optimised 
for minimum Holevo phase variance and \j0) z as determined using the asymptotic approximations. 



Measure 


IV'opt) 




NAcfr 


tt = 3.14159265359265... 


0.631965250820959 x TV 3 / 4 


-ZVA0H 


7T = 3.14159265359265... 


0.533458147005221 x TV 3 / 4 


NL 1V 


tt 3 /4 = 7.75156917007495 . . . 


4tt = 6.87518581802037... 


NL S 


10.7105294850660 


12.305050002393 


NL B 


12.4148198362985 


35.78817 


NL 2/3 


2.94815524951393 


3.07129589640767 


NLp 


2.7661594839 


V2 = 1.41421356237309... 



(5.89) 



This is twice what was found in (42), and using this the Fisher length is approximately Lp i=s \/2/N. 
The reason for this factor of two is that the approximation used for the probability distribution in 
fl2"f is not correctly normalised. Here the normalised version has been used, which is twice that 
given in (42). 

Note that this approximate analytic result agrees very precisely with the result found using 
the exact calculations. We can also determine the scaling constants for the other measures of phase 
variance using the analytic approximation ( 5.86 ). The scaling constants for the standard and Holevo 
phase variances were found above. The reciprocal peak can be evaluated as 



T - * 



V2 



lim 



3 I*(3/4) 0-0 Jl^jcft 



2n 
j 



r(5/4) 



r(3/4) 



The scaling is therefore 



N 



-47T 



r(5/4) 



n 2 



r(3/4) 



(5.90) 



(5.91) 



This scaling constant is approximately 6.87518581802037, agreeing very accurately with that found 
from the exact calculations. 

The Sussman measure and entropic length can also be determined by numerical integrals. The 
scaling constants found, along with the scaling constants found above for the other measures, are 
given in Table 5.2. Note that there is very good agreement with the results based on the exactly 
calculated data in these two cases as well. The numerical integral for the entropic length is partic- 
ularly difficult to calculate, as the probability distribution falls very slowly with the phase, and the 
integral must be calculated to very large values of j<f>. For this reason only 7 significant digits could 
be found. 

It is also possible to determine the scaling constants for the state optimised for minimum intrinsic 
phase varianc e usi ng an asymptotic approximation. It is easy to see that in the limit of large N and 
small (p, Eq. ( 5.68 ) becomes 



P{4>) 



l l 

2^iV + 2 



(jfe) 2 (l + cos^N + 2)) 

l({> 2 -i( 
2V 2 \^ AT+2 J 



2nN- 



1 + cos N(j) 

[(N(f>) 2 - TT 2 }- 



(5.92) 



In the second line it has been assumed that N + 2 « N in the limit of large photon number. 
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Note that the asymptotic approximations for both of these states have the general form 

P(<f>) = Nf(N<f>). (5.93) 

For asymptotic functions of this form, we obtain X/N scaling for each of the measures of phase 
uncertainty, provided that the integrals are bounded. To see this, note that each of the measures is, 
in terms of the asymptotic function, 

Ntt 

(NAcj)) 2 = ( f{x)(x-x) 2 dx 



-Ntt 
Ntt 



logiVXn = - J f (x) log f(x)dx 



-Ntt 
N-n 



= J l -iw d - 

-Ntt 
x+NLc 

1 - C = J f{x)dx 

x-NLo 

(NL^y 1 = maxf(x) 



Ntt 

(NLs)- 1 = J f(x)dx. (5.94) 

-Ntt 

In the asymptotic limit the Holevo variance is given by the same expression as the standard variance. 
For the standard or Holevo variance in the case of the \j0) z state, the integral is not bounded, and 
must be considered up to the limit <j) = tt/2. That is why 1/N scaling is not obtained in that case. 

For the asymptotic expression for the optimal input states, the scaling constant for the variance 
is already known to be tt, using the analytic result for the Holevo phase variance. The scaling 
constant for the reciprocal-peak measure of the phase uncertainty is easily determined analytically 
as 7r 3 /4. For the other measures, the scaling constants can be determined using numerical integrals, 



and the results are as shown in Table 5.2. Comparing the results in Tables 5.1 and 5.2, it can be 
seen that there is excellent agreement between the scaling constants for all of the measures. 

We therefore find that the asymptotic expressions accurately give the scaling constants for all 
the cases considered, except the Holevo or standard variance for the \j0) z state. For these cases 
the main contribution is from significant values of 0, where the asymptotic expression is not ac- 
curate. The scaling constants are higher for \j0) z than the optimal state for all of the measures 
except the reciprocal-of-peak and Fisher length. As explained above, these measures appear to give 
unrealistically low estimates of the phase uncertainty for the \j0) z state. 

Similarly the Holevo and standard variances give unrealistically high estimates of the phase 
uncertainty for \j0) z . A more accurate way of comparing the two states is through confidence 
intervals. For the 2/3 confidence interval, the scaling constant is only slightly higher for \j0) z 
than the optimal state. The difference is far more pronounced if we consider a higher probability 
confidence interval. For a 90% confidence interval, the scaling constant is only a little higher for the 
optimal state, at about 4.88, but it is 37.69 for \j0) z . 

Probably the most accurate way of comparing the two distributions is through the entropic 
length, as this corresponds exactly with the phase information contained in the distribution. The 
scaling constant for the entropic length is almost three times higher for the \j0) z state than for the 
optimal state. These results demonstrate that although the phase uncertainty scales as TV -1 for 
\j0) z , it is not as good as the optimal state. 



Chapter 6 

Optimum Adaptive Interferometry 



Now that the input states have been fully considered, the next factor to consider is the optimum 
way of measuring these states. There are basically two components to this: the feedback phase that 
is used during the measurement, and the phase estimate that is used at the end of the measurement. 
It has been shown |3^, pi], Q that in the single mode case it is possible to make very good phase 
measurements by using feedback to an auxiliary phase shift. In order to apply the same principle 
here, I consider adaptive measurements where the phase to be measured, if, is in one arm of the 
interferometer, and a known phase shift, is introduced into the other arm of the interferometer, 
as in Fig. 



5.1 



The initial feedback scheme that will be considered is that where the introduced phase shift is 
adjusted in order to minimise the variance in the phase estimate after the next photodetection. In 
order t o ev aluate this feedback scheme, the optimal phase estimates are required; these are derived 
in Sec. |6.2| . It is also possible to select the feedback phases in order to minimise the final variance, 
using numerical minimisation techniques. 

For these adaptive schemes to work, the feedback that adjusts $ must act much faster than the 
average time between photon arrivals. For simplicity I make the assumption that the feedback is 
arbitrarily fast, which simply means that the phase $ is always assumed to have been changed before 
the next detection occurs. It is the ability to change $ during the passage of a single (two-mode) 
pulse that makes photon counting measurements more general than a measurement of the output 
J z considered in [|§ |o). 

As mentioned in the previous chapter, it is assumed that the photodetectors have unit efficiency. 
In addition, only single detection events are considered, rather than multiple detections. Physically, 
simultaneous detections correspond to individual detections that are too close together to be re- 
solved by the apparatus. Here it will be assumed that the apparatus is arbitrarily fast, so that all 
the detections can be resolved, and the feedback phase updated between detections. This is not 
necessarily realistic for current technology, particularly for larger photon numbers. However, the 
purpose of this study is to examine how far phase measurements can, in principle, be improved, 
rather than to examine the limits of current technology. 



6.1 Preliminary Theory 

To describe the measurement, it is convenient to denote the result u from the mth detection as u m 
(which is or 1 depending on which output the photon is detected in), and the measurement record 
up to and including the mth detection as the binary string n m = u rn . . . U2Ux. The input state after 
m detections will be a function of the measurement record and if, and is denoted as |^(n m , if)). In 
the calculations these states are not normalised (except for the initial state), in order to express the 
state as a power series in e lv . 
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The time between detections does not give any information, and is not included in the measure- 
ment record. To see this, note that the measurement operator for no detection is 

n = l- 3(a + a + 6 + 6)df, (6.1) 

where a and b are the annihilation operators for the two input modes. As this does not depend on 
the phase, the probability distribution for the phase is unchanged between detections. In addition, 
we consider only states with a fixed total photon number N. This means that, after m detections, 
the system will be in an eigenstate of aJa + tfb with eigenvalue N — m. Therefore the measurement 
operator Qq does not produce any system evolution. As neither the probability distribution for the 
phase nor the system state changes between detections, the time between detections may be ignored. 

Now the evolution produced by detections will be considered. After the first beam splitter the 
operators for the two beams are (in the Heisenberg picture) 

(a + ib)/V2, {ia + b)/V2. (6.2) 

The two beams are then subjected to phase shifts of ip and <&, so the operators become 

e tv (a + ib)/V2 : e^{ia + b)/V2. (6.3) 
Lastly, the effect of the second beam splitter is the same as the first, giving the two operators 

j e %>+*)/2( as in0 + &cos0), 

j e i(v+*)/2( acos 0_ b sing), (6.4) 

where 6 = (cp — $)/2. Ignoring the unimportant initial phase factors, these can be represented as 
the operators co,ci, where 

c u = a sin (9 + un/2) + b cos (9 + un/2) . (6.5) 
The input state is then determined by the initial condition 

Mno.vOHM, (6.6) 
where no is the empty string, and the recurrence relation 

\ip(u m n m -i,ip)) = c Um {ip)-= (6.7) 

y/2j + 1 — m 

These states are not normalised, and their norm represents the probability for the measurement 
record n m given ip. That is, 

P(n m \ip) = (ip(n m , p)\ip(n m , ip)) . (6.8) 
An arbitrary input state with N = 2j photons can be expressed as a sum of J z eigenstates 

3 

\^) = E I'M*- ( 6 - 9 ) 
The state after m detections will be (for m > 1) a function of (p. It will be denoted as follows: 

j-m/2 

\ip(rhn,(p)) = E i>ii,m(nm,<p)\j - m/2,fj,) z . (6.10) 

fj,=-j+m/2 



Using the recurrence relation (3.7), it can be shown that the functional form of ^^(iim, ip) is always 

ra/2 

VV,™("™> V) = E V , A I ,'n,fe( n m)e' ifc¥ '. (6-11) 
fe=-m/2 
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The recurrence relation for the coefficients ip^ m ,k is 

„— i(* m — « TO 7r) /2 



2^/2j -m + 1 



S-^-l.m-l.fe-lC^m-l) -is + ^+I, m -l,fc-lKn-l) 



gt($ m -tl ra T)/2 r 

2V2j - m + T 



/U+i,m— l,fe+A { n m— 1 

) ,(6.12) 



where 



m 
"2 



1. 



(6.13) 



We need to express the system state in this form for two main reasons. Firstly the feedback 
scheme should not depend on the actual value of the phase, only on the measurement record. To 
determine the feedback phase, the probability distribution for the system phase is required. This 
can be determined from the variation of the state with the unknown system phase. This requires 
the above coefficients, as repeating the calculation for each individual value of the phase would be 
too inefficient. Secondly, we can perform the entire calculation independently of the system phase, 
and average over the system phase at the end of the measurement. This allows us to take account 
exactly of the full range of input phases. 

The probability distribution for the unknown phase can be determined using Bayes' theorem 



P(ip\n m ) = 



P(tp)P(n m \tp) 
P{n m ) 



(6.14) 



The probability distribution for the phase at the start of the measurement, PQp), will be flat, as it 
is assumed that there is no prior knowledge about the phase. The divisor P(n m ) is independent of 
the phase, and therefore only provides a normalising factor to the phase distribution. Ignoring these 
terms gives 

P(ip\n m ) cx P(n m \<p), (6.15) 



and therefore 



P(tp\n m ) oc (ip(n m ,(p)\ip(n m ,(p)). 



(6.16) 



Note that this result is equivalent to Eq. ( 1.72 ) for the case of continuous measurements. A similar 
Bayesian approach to interferometry has been considered before, and used in analysing experimental 
results H^]. However, this was done only with non-adaptive measurements and with all particles 
incident on one port. 



6.2 Optimum Phase Estimates 

Before describing the phase feedback technique, I will firstly describe how to select the final phase 
estimate. This is necessary because it is not possible to determine the phase variance produced 
by a phase feedback technique without specific final phase estimates. The best estimate to use is 
that which minimises the Holevo variance in the final phase estimate. This can be determined by 
summing over the 2 2 - 7 combinations of results, then averaging over the input phase. 

For greater generality, the optimum phase estimate after m detections will be considered, rather 
than specifically the phase estimate at the end of the measurement. The result for the final phase 
estimate can then be obtained by substituting m = 2j. First, summing over the combinations of 
results gives the probability distribution for the error in the phase estimate as 

2 m -l 

P(4\<p)= P(n m \v)S(<t>-mnm)-v)), (6.17) 

[n m ]=0 

where the square brackets in [n m ] denote the numerical value of this binary string interpreted as a 
binary number, and <£>{n m ) is the final phase estimate. 
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Next we wish to average over the system phase and the initial feedback phase. For a feedback 
scheme to be unbiased (in that it treats all input phases equivalently), the initial feedback phase 
should be chosen at random, as there is no information to base this phase on. This phase should 
therefore be averaged over in order to determine the average probability distribution. In determining 
the coefficients ipfj,. m ,k{n m ), a specific initial feedback phase must be chosen, so at first it might appear 
that this phase cannot be averaged over. Note, however, that all the successive feedback phases are 
relative to the initial feedback phase. If the initial feedback phase is altered by some amount A$, 
then all the successive feedback phases will be altered by the same amount (for the same detection 
results). This is because, if the feedback scheme is unbiased, the initial feedback phase provides a 
reference phase. 

Therefore it is only the difference between the system phase and the initial feedback phase that 
is significant, and we need only average over one of them. I will take the initial feedback phase to be 
zero, which is equivalent to measuring all phases relative to the initial feedback phase, and average 
over the system phase. Performing this average gives 

7T ?n -I 

/l 
dip— P(n m \tp)S(<p-(t>(n m )-(p)), 

-7T [n m ]=0 

= ^ E P{nmW{n m ) - 4>)- (6-18) 

[n m ]=0 

The exact phase variance for the measurement scheme can be determined from this probability 
distribution. Evaluating (e 4< ^) gives 

2 m -l 



(e*)= I d^±- V P(n m \<p(n m ) - cj>) 

i [n m ]=0 

2 m — l n 

= J- E e ^ ( " m) / #e-^^)-^P(n ro |<^(n m ) - 4>) 

2" 1 — 1 n 

= J- E eMnm) I dxe- ix P{n m \x). (6.19) 

In order to minimise the Holevo phase variance, we wish to maximise \{e 1 ^) \ ■ A phase estimate that 
maximises this is 

7T 

(f(n m ) = arg J e lip P(n m \(p)dtp 

— TT 
7T 

= arg / e tlp (ip(n m ,tp)\ip(n m ,tp))dtp 



7T 

= arg J e lv P((p\n m )dip 



= arg(e^), (6.20) 

where the expectation value is determined from the probability distribution for the phase based on 
the measurement record. For the specific case of the phase estimate at the end of the measurement, 
this can be calculated as 

ip(n 2j ) = arg ^ ipo,2j,kipo,2j,k+i- ( 6 - 21 ) 

k=-j 
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Unfortunately there is a slight ambiguity here, as the same Holevo phase variance will be obtained 
if a constant is added to these phase estimates. This would make the probability distribution biased, 
and as discussed in the introduction, a better way of evaluating the variance when the measurements 
may be biased is 

V&fo)= [Re<e*)]" a -1. (6.22) 

Note that e 1 ^ is used here, rather than e 1 ^ - * 3 ), because <f> is the deviation from the system phase. 
Minimising this estimate of the phase variance is equivalent to maximising Re(e^). If any constant 
is added to the phase estimates given by Eq. ( |6.2C| ), the value of |(e**)| will be the same, but 
the value of Re{V*) will be smaller, as (e 1 ^) is complex. Therefore the phase estimates given by 



Eq. ( 6.20 ) are the unique solution that maximises Re(e 1 ^). 



With these phase estimates, the exact variance after m detections can be determined using the 
simple expression 

2 m -l 

3 lip (i>(n m , ip)\ip(n m , (p))d(p 



|<e*) 



1 

2^ 



E 



1=0 



(6.23) 



As these phase estimates are unbiased, the absolute value will be used, rather than the real part. 
The final variance can be determined using 



2^-1 

l(e # >l = E 

[n 2j ]=0 



3-1 
k=—j 



(6.24) 



This means that during calculations, after each sequence of measurements, the phase estimate and 
contribution to |(e l ^)| can be determined as the phase and magnitude respectively of 



k=-j 



(6.25) 



6.3 The Feedback Technique 

In order to find the optimum phase feedback technique, we need to find the feedback phase <5> m for 
each measurement record n m -\ that maximises the value of |(e i<6 )| as determined using Eq. (6.24) 
(and therefore minimises the Holevo phase variance). This is, in general, a very difficult problem, 
and I will initially consider a much simpler feedback technique that can be determined analytically. 

Rather than choosing each feedback phase to minimise the final phase variance at the end of 
the measurement, we can choose the feedback phase that minimises the phase variance after the 
next detection. This will mean that the last feedback phases are optimal, but not necessarily the 
intermediate feedback phases. This is because they minimise the intermediate phase variances, not 
the final phase variance. 

In order to see how to choose the phase estimate, recall that the value of | (e 2 *) | after the mth 



detection is given by Eq. (5.23). For the feedback phase before the mth detection, <I> m , we need only 
consider the part of the sum for the measurement record n m -i, as this is the only part of the sum 
that is affected by the feedback phase. We must still sum over the mth detection result, as this is 
still unknown. The expression to be maximised is therefore 



E 

M m =0,l 



(6.26) 



In order to use this expression we require (ip(u m n m ^i, ip)\ip(u m n m ^i, tp)) explicitly in terms of 
u rn . It is straightforward to show from Eq. ( |6.12 ) that 



(i/j(u cp)\ip(u m n m -i,cp)) = -[(V»(ra m _i,^)|^(n fr ,_i,¥?)) 



+\m-i{p)e > ' p e 



(6.27) 
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where \ m -i(<p) is defined by 



where 



m — 1 



n— — m+l 
£m — l.n ^Cm— 1, 



A m -i . 



2j — m + 1 



and where 



k-k> 



(6.28) 



(6.29) 



(6.30) 
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(6.31) 



Using this result it is possible to show that 



-[(V(n m _ 1 ,^)|^(n m _ 1 ,< /3 ))+A m _ 1 (^)e^e- 4( * m - tl '"^ 



M($ m ) = J2 
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This can be expressed in the form 

M{9 m ) = \a + ber 1 ^ +ce I$ " 

where 



6e 



-i<6 m 

— ce 



(6.32) 



(6.33) 



b = 7rA m _i,_2, 

C = ^m-l.O- 



(6.34) 

(6.35) 
(6.36) 



There is an analytic solution for the $ m that maximises M($ m ). This solution gives three phases, 
$o and 3>±, and the phase that is optimal must be found by substituting into Eq. ( |6.33] ). These 
phases are given by 

$ = arg(6a* - ca), (6.37) 
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and 

lc 2 ± V C 2+W\ 

*± = arg W — 

where 

Cl = (a*c) 2 - (ab*) 2 + 4(|6| 2 - \c\ 2 )b*c 7 (6.39) 

c 2 = -2ilm(a 2 b*c*). (6.40) 

Note that M($) = M(<I> + 7r) so that in addition to the solution found by this method there will be 
another differing by tt. It does not matter which of these is chosen; it simply reverses the significance 
of the two alternative detection results u, n = and 1. 

As mentioned above, the initial feedback phase $i should be chosen at random, as there is no 
information to base it on. At each following step, we determine the optimal feedback phase by the 
method described above, then determine the evolution of the state for that feedback phase. This 
process continues until all photons have been counted. The measurement record is then the binary 
string ri2j = U2j . ■ . u-iu\ and the result is a posterior distribution P(ip\ri2j) that is proportional to 
( - 0( n 2j, <p)\ip(n2ji L P)) an d is characterised by the 2j + 1 numbers ipo,2j,k(i^2j)- 



(6.38) 



6.4 Stochastic Method 



For a moderate number of photons it is possible to determine the exact phase variance by systemat- 
ically determining the evolution of the state for each combination of measurement results and using 
Eq. (3.24). As the number of possible measurement records increases as 2 2j , the exact variance can 
be determined only for moderate photon numbers (up to 20 or 30) . For larger photon numbers it is 
necessary to determine the phase distribution stochastically. 

The initial feedback phase is selected at random, and the results of the detections are selected 
by their probability of occurring. In order to determine the probabilities, a specific system phase 
must be selected. For simplicity, the system phase was taken to be zero in the results that will be 
presented here. This leads to no loss of generality, as the initial feedback phases were selected at 
random. 

In order to determine the probabilities, it is simplest to determine the phase dependent state 
coefficients f/> (1 ,m(n m ,i/'). The state coefficients will change as 



ipfi, m (n m , f) = s+ sine m Vv+i,m-i( n ™-i, ip) + s- cos9 m ^_i 

,m— 1 1 

<p), (6-41) 



where 



e, 



<f - $m + U m TT 



(6.42) 



The probability of obtaining the result u m is given by 



1 e 

P(u m \(p,n m -i) = - + — 



COs(^-$ m ) ^ A 1 \i>iM,m-l(nm-U<P)\* 
^=-3 + ^ 



2 2j - m + 1 



(6.43) 



Here it has been assumed that the coefficients ^ (tl)TO _x(n m _i, (/?) are normalised. The recurrence 
relation ( 6.41 ) does not give the normalised coefficients. The coefficients determined from Eq. ( 6.41 ) 
should be normalised in a separate calculation. 

The detection results were chosen with probabilities determined using ip = and these formulae, 



and the final phase estimates were determined using Eq. fl6.2ip. For the ensemble ff= of M 
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final phase estimates the Holevo phase variance was estimated by 

-2 



M 



Re A r 1 J2 ^ 



1. 



(6.44) 



6.5 Modulo 7r States 

In the case of the state with equal photon numbers in both ports, the above techniques will not 
apply exactly, as the phase distribution repeats modulo n. This means, for example, that (e l< ^) = 0. 
In the previous chapter the measure of the phase variance used for this state was 

^(0) = (|(e 2 ^)f 2 -l)/4. (6.45) 
Most of the above analysis must also be modified to consider the phase modulo ir. 



[n m ]=0 

This means that the optimal phase estimate is 



1/2 ^ W=° 

I ^ e 2 ^("") J dxe' 2lx (i;(n m ,x)\^(n m ,x)) 

-7T/2 



tt/2 

(£(n m ) = §arg / e 2ltp {ip{n mi tp)\ip(n m , if)) dip. 



This can be evaluated at the end of the measurement using 



i-2 



<£(n 2 j) = ±arg ^ V'o,2j,fcV'o,2j,fc+2! 



and the final variance is calculated using 

1(^)1= E 

[n 2 j]=0 



J-2 
k=—j 



2j,k+2 



Now instead of Eq. (6.26), the expression to be maximised is 

7/2 



M($ m )= £ 



rn^ m — l 3 rn^ m — l 7 



-7T/2 



(6.46) 



Firstly the distribution (6.17) should be averaged over the interval [— 7r/2,7r/2] rather than 
[— 7r,7r]. This gives 

P(0) = - ) P(n m \ip(n m ) - 4>). 

7T c — ' 

[n m ]=0 

Now, rather than evaluating (e 1 ^ 1 ), we wish to find (e 21 ^). This can be evaluated as 

tt/2 



(6.47) 



(6.48) 



(6.49) 



(6.50) 



(6.51) 
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Using Eq. (6.27), this becomes 

tt/2 



1 



™ ' -7T/2 

+A,* n _ 1 (^)e- 4V e l( *'"- u '"" ) ]e 2 ^^| 

tt/2 



u m =0,l 



-tt/2 



m — 1 



i(n— l)ip 



- y 

2 ^ 

u m =0,l 



,m-l,fc+2 



Thus a, 6 and c are given by 



7T 2 
a = ~o £ £ ^,m-l,feV ; ^,m-l,fe+2! 



(6.52) 



(6.53) 



& — ^-Ato-1,-3, 
_ 71" x* 



(6.54) 
(6.55) 



The last equation that must be modified for this case is the formula for estimating the phase variance 
from the data, which becomes 



Re^M- 1 ^^ 2 ^'^ -1 



(6.56) 



6.6 Results 

The results of using this adaptive phase measurement scheme on the four alternative input states 
are shown in Fig. 6.1. The ph ase v ariances for states up to N — 20 (or N = 30 for \jj) z ) were 
determined exactly using Eq. (6.24), whereas those for larger photon numbers were determined 
using the stochastic method described in Sec. |6.4| . The sample sizes used were about 2 15 for the 
smallest photon numbers, down to 2 10 for the larger photon numbers. The phase variances are very 
close to the phase variances for canonical measurements for all of these input states. 

For the optimal input states described in Ch. ||, the scaling is close to 1/N 2 , but the phase 
variances do differ relatively more from the canonical values for larger photon numbers. If we plot 
the phase variances as a ratio to the canonical phase variance (see Fig. [T^), we find that the ratio 
of the phase variance to the canonical phase variance increases fairly regularly with photon number. 
This ratio possibly increases proportional to log N. In that case the introduced phase variance would 
increase as log N/N 2 , as is the case for the optimal single mode phase measurements considered in 
Ch. |. 

For \ j0) z the variances are very close to those for canonical measurements, scaling as 1/iV 1 / 2 . If 
we look at the distribution of the phases resulting from these measurements, we find that there is a 
sharp peak, but a significant number of results with large error that produce the large variance (see 
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Figure 6.1: Variances in the phase estimate versus input photon number 2j. The lines are exact 
results for canonical measurements on optimal states iV'opt) (continuous line), on states with all 
photons incident on one input port \jj) z (dashed line), on states with equal photon numbers incident 
on both input ports \j0) z (dotted line), and the state (|j0) 2 + |jl) z )/v / 2 (dash-dotted line). The 
crosses are the numerical results for the adaptive phase measurement scheme on jV'opt), the circles 
are those on \jj} z , the pluses are those on \j0) z , and the asterisks are those on a (\j0) z + \jl) z )/y/2 
input state. All variances for the |j'0) 2 state are for phase modulo ir. 




Figure 6.2: The ratio of the phase variance using the feedback scheme of Sec. 6.3 to the canonical 
phase variance for optimal input states. 
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Figure 6.3: The phase distribution resulting from using adaptive phase measurements on an input 
state of \j0) z for 800 photons. The vertical axis has been cut off at 100 (the peak count is almost 
500) to show the tails more clearly. 



Fig. |6.3| ). Similarly, for the case of the state (\j0) z + |jl) z )/v / 2, there are a large number of results 
at ±tt, as seen in Fig. 6.4. As for the canonical distribution, this is why the phase variance does not 
decrease significantly with photon number. 

Now recall that although the phase uncertainty of the \j0) z state as measured by the square root 
of the variance does not scale as A -1 , other measures of the phase uncertainty do scale as A -1 . 
This indicates that if the data is analysed in a way corresponding to one of these other measures, 
the uncertainty should scale down more rapidly with photon number. I will consider 2/3 confidence 
intervals, as the other measures do not make sense for discrete data. 

The phase uncertainty for adaptive measurements on the \j0) z state as measured by the 2/3 
confidence interval is plotted in Fig. |6~5| . A power law of the form cN~ p was fitted to this data set 
(for photon numbers of 20 and over), and it was found that the best fit was for c = 1.39 ± 0.08 and 
p = 0.69 ± 0.01. This means that the scaling is not as good as the A -1 scaling for the canonical 
phase distribution, but it is still far better than the A -1 / 4 scaling indicated by the variance. 

In contrast, the confidence interval for measurements on the optimal state (also shown in Fig. |6.5| ) 
scales very close to A -1 . In fact for the range of photon number considered the confidence interval 
for the measurements is never more than about 30% above the canonical confidence interval. This 
means that although the \j0) z state has 2/3 confidence intervals for the canonical distribution that 
are close to those for the optimal state, the confidence intervals for the measurements are far worse. 

As mentioned in the previous chapter, the uncertainty in the mean phase will scale as A -1 / 4 
rather than A -1 for \j0) z - According to the Central Limit Theorem, the probability distribution 
for the mean of a large number of measurements will be close to a normal distribution, even if the 
probability distribution for each individual sample is far from a normal distribution. In addition, the 
variance in the mean is the variance for an individual sample divided by the number of samples. As 
the probability distribution for the mean is approximately Gaussian, this means that the uncertainty 
for any other measure will have the same scaling as the square root of the variance, A" 1 / 4 . 

For example, for a photon number of 1600 and a sample size of 2 10 , the uncertainty in the mean 
phase as indicated by the standard error (the standard deviation divided by the square root of the 
number of samples) is 4.0 x 10 -3 . The mean phase is also around this value, at 3.5 x 10 -3 . On the 
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Figure 6.4: The phase distribution resulting from using adaptive phase measurements on an input 
state of (\j0) z + \jl) z )/y/2 for 800 photons. 
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N 

Figure 6.5: The phase uncertainty resulting from using adaptive phase measurements as measured 
using 2/3 confidence intervals. The pluses are the numerical results for an input state of \j0) z and the 
crosses are those for optimal states. The continuous line is the confidence interval for the canonical 
distribution for \ip pt), an d the dotted line is the function fitted to the \j0) z data. 
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Figure 6.6: The Holevo phase variance for optimal input states under various measurement schemes. 
The canonical phase variance is shown as the continuous line, the results for the adaptive measure- 
ment scheme of Sec. [Tj| as cross es, th e non-adaptive measurement scheme of Eq. ( |S.57 ) as circles, 
and the feedback scheme of Eq. (3.58) as pluses. 



other hand the median is much closer to zero, at about 5.9 x 10 -5 . 

In order to find the best phase estimate based on the data, we would need to multiply together 
the probability distributions for the phase from each of the samples. Similarly to the case for 
the individual samples, the optimal phase estimate would then be given by iarg^e 2 ^), where 
the average is based on that distribution. The problem with this method is that it is extremely 
computationally intensive. For the example given, we would need to multiply together 1024 sums, 
each with 3200 terms, resulting in a total of more than 3.2 million terms. 

I have also considered phase measurements using two other measurement schemes. The first is a 
non-adaptive phase measurement introduced in Ref. ]54| and defined by 

777 7T 

$ m = $o + ^. (6.57) 

This is analogous to heterodyne detection |34j on a single mode, in that the phase <E> equally weights 
all relevant values over the course of the measurement. The second scheme is a simple adaptive 
feedback scheme using a running estimate of the phase: 

$ m = arg(e^) 

J 2 2 

= af S H VV,m-l,fc^,m-l,fc+l- (6.58) 

This is motivated by the relative success of the analogous simple feedback scheme |34] for phase 
measurement of a single mode. The difference here is that there is no tt/2 term. For interferometry, 
small phase uncertainties are obtained when the phase difference between the arms is close to zero 
p8| , p9| , poj , rather than tt/2. As mentioned in the previous chapter, the reason for this difference is 
that a different scattering matrix for the beam splitters has been assumed for interferometry than 
for dyne measurements. 
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Figure 6.7: The exact phase variance for \jj) z input states under two different measurement schemes 
as a ratio to the canonical phase variance. The results for the adaptive measurement scheme of 
Sec. |5.3] are shown as the continuous line, and the non-adaptive measurement scheme of Eq. (6.57) 
as the dotted line. 



The results of using these two measurement schemes, as well as the adaptive measurement scheme 
of Sec. 6.3, on the optimal input states, are shown in Fig. |6~6|. T he non-adaptive measurement scheme 
is far inferior to the adaptive measurement scheme of Sec. |3.3|, and the variance scales as iV _1 . The 
simple adaptive feedback scheme also gives poor results. Although most of the phase results for this 
feedback scheme have small error, there are a small number of results with very large error. This 
also means that the results shown in Fig. 6J3 are fairly erratic, as the results for which a large error 
sample was obtained have much larger phase variance. 

I also considered the non-adaptive measurement scheme on the sta te w ith all photons in one 
port. The exact results for that case for N up to 30 are shown in Fig. 3.7. The phase variance is 



not much more than the canonical phase variance, about 20% more for N — 30 and still decreasing. 
This demonstrates that for this state there is relatively little improvement in using a more advanced 
feedback scheme for larger photon numbers. The biggest improvement is a reduction in the variance 
of about 24% for N = 3. 



6.7 Optimal Feedback 

The next question is whether the adaptive measurement technique described above is optimal. Note 
firstly that the initial feedback phase has no effect, because it is effectively averaged over by averaging- 
over the system phase. Secondly the last feedback phase is always optimal, as was noted above. This 
means that for states with 1 or 2 photons the measurement technique must be optimal. In fact, for 
the states considered here the phase variance was equal to the canonical phase variance for 1 or 2 
photons. 

To see if the phase variance was equal to canonical for arbitrary states, the complete range of 
possible states was considered. For 1 photon the state can be expressed as 



1/2 

M= £ ^|1/2, M > 

M=-V 2 



(6.59) 
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Figure 6.8: The value of ((e^)! for a 1 photon input state as a function of The canonical 

variance is shown as the continuous line and the value for the measurements is shown as the dotted 
line (these lines overlap). 



There are therefore two coefficients, ?/;±i/2j which can in general take complex values. The magnitude 
and phase of these coefficients give four real numbers that can be varied for these states. There is 
the restriction, however, that the states must be normalised, which removes one degree of freedom. 

Also the absolute phase of the coefficients is irrelevant. That is, changing the phase of both 
coefficients ip±i/2 by the same amount gives an equivalent state. In addition, the relative phases of 
ip±\/2 are irrelevant. This is because adding a phase difference between the coefficients simply gives 
an equivalent input state with a phase shift. These considerations mean that only real ^±1/2 need 
be considered, and there is therefore only one degree of freedom. 

The value of tpi/2 w& s varied in 10000 steps from to 1 (see Fig. 6.8), and it was found that the 
phase variance obtained was identical to the canonical phase variance for the entire range. Note that 
this case is independent of the feedback scheme, as there is only the initial feedback phase, which 
has no effect. This means that the variance is the same as the canonical variance, regardless of the 
input state or feedback scheme. 

For the case of 2 photons the state has three coefficients. Taking account of the magnitude and 
phase this gives 6 real numbers that can be varied. The same considerations as for the 1 photon 
case apply, leaving only 3 degrees of freedom. We can vary the magnitude of two of the coefficients 
independently, and the phase of one. 

The magnitudes of ip-\ and -01 and the phase of ipo were varied over the complete range with 
100 steps in each of the variables, and it was found that the phase variance obtained was identical 
to the canonical phase variance. This demonstrates that the phase variance is as good as canonical 
for 2 photons, independent of the input state. This case is not independent of the feedback scheme, 
and if any other feedback phase is used the feedback is not quite as good as canonical. These results 
were also obtained when the states were selected completely at random. 

For the case of optimal input states with 3 or 4 photons, it was found that it is not possible to 
decrease the variance by altering the intermediate feedback slightly, so showing that the feedback 
technique is locally optimal for the phase variance. For more than 4 photons it is possible to reduce 
the phase variance by varying the intermediate feedback phases, and so the feedback is not optimal. 

In order to show that the feedback is globally optimal for 3 or 4 photons, it is necessary to test 
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Figure 6.9: The phase variance for the 3 photon optimum input state as a function of the second 
feedback phase . Th e phase given here is relative to the second feedback phase given by the feedback 
scheme of Sec. 6.3. The other feedback phases are as given by this feedback scheme. 



the entire phase range. Only optimal input states will be considered here, and the more general 
case will be considered later. There are three factors that reduce the number of phases that need 
be varied. The first two are as noted above: the first feedback phase has no effect (and so may be 
ignored), and the last feedback phase is always optimal (and so need not be varied). The third is 
that the contribution to the phase variance for a sequence of detections is independent of the first 
detection result. This is because changing the first feedback phase by it reverses the significance of 
the first detection results, and the first feedback phase is arbitrary. 

The consequence of these three factors is that for 3 photons the variation of only one feedback 
phase needs to be considered, and for 4 photons the variation of three feedback phases needs to be 
considered. The phase variance for the 3 photon case as the second feedback phase is varied from its 
value for the feedback technique of Sec. 5.3 is shown in Fig. 6.9. This figure shows that this feedback 
technique is globally optimal for 3 photons. Since the phase variance is above the canonical phase 
variance in this case, this demonstrates that it is not possible to perform canonical measurements 
using photodetection and feedback alone. For 4 photons the second feedback phase and two third 
feedback phases must be varied. This case was tested with 100 steps in each of the three variables, 
and it was found that the feedback technique is globally optimal in this case also. 

In order to see how far the phase variance could be improved for photon numbers above 4, the 
feedback phases were optimised using function minimisation techniques, and the results are shown 
in Fig. 6.10. Unfortunately the number of feedback phases increases exponentially with the photon 
number, making this technique infeasible very rapidly, and therefore only results up to N — 12 are 
shown. As can be seen, this optimisation only gives minor improvements in the phase variance, with 
the maximum reduction in the phase variance being about 3.5%. 

Note that the input states used here give minimum canonical phase variance, but as the phase 
variances obtained for these feedback schemes differ from the canonical phase variance (above 2 pho- 
tons), these input states are not necessarily optimum for these measurements. Therefore numerical 
optimisations were performed where both the input state and the feedback were optimised for, and 
the results are also shown in Fig. 5.1C| . 

It was found that the phase variance was reduced below the results where only the feedback was 
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Figure 6.10: The phase variance for the feedback scheme of Sec. 6.3 and two numerically optimised 



feedback schemes as ratios to the minimum intrinsic phase variance. The variance for the feedback 



scheme of Sec. 5.3 is shown as the crosses, the case where the feedback alone is numerically optimised 



is shown as circles, and the case where both the state and the feedback are numerically optimised is 
shown as pluses. 



optimised for, even for the case with 3 photons. In the case with 3 photons, the improvement is only 
about 0.0005%, which is not visible on the graph. The improvements for the larger photon numbers 
are slightly larger, almost 1% for the largest photon number calculations have been performed for. 
These improvements are still very minor, and much less than the improvements obtained by solving 
for the feedback. 



For 3 or 4 photons it was found that the feedback scheme of Sec. 3.3 is still optimum for the 
numerically optimised states. For more general states, however, this is not the case. For example, 
for the 3 photon state given by 

|V) = ^= (13/2, -3/2) z + [3/2, 1/2), + [3/2, 3/2),) , (6.60) 



the variation of the phase variance with the second feedback phase is as given in Fig. p.ll| . The 
phase variance here is much smaller when the feedback phase is altered by tt. In fact, the vari ance 
is a maximum when this feedback phase is at the value given by the feedback scheme of Sec. |3.3| . 
Not only this, but the variance is less than the canonical variance for the entire range. 

This is a strange result, as we would normally expect that it is impossible for the phase variance 
for the measurements to be smaller than the canonical phase variance. To understand the reason 



for this, consider the expression for (V^). From Eq. ( 1.52 ) of the introduction, for measurements of 
a single mode we have 

oo 

(e*+) = j^|n)<n + l|^)#n,n+l. (6.61) 

71=0 

The corresponding expression for interferometric measurements is 

3-1 

(e**) = m») y {j^ + ±\i>)H^ + x. (6.62) 
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Figure 6.11: The phase variance for the 3 photon input state of Eq. (6.60) as a function of the second 
feedback phase. The phase given here is relative to the second feedback phase given by the feedback 
scheme of Sec. 3.3. The other feedback phases are as given by this feedback scheme. The continuous 
line is the canonical phase variance, and the dotted line is the variance for the measurements. 



For canonical measurements all of the H^ ^ + i are equal to one. We would normally expect that 
any smaller value of H^+i would lead to a smaller value of |(e l( ^)| (and a larger Holevo phase 
variance). This is not necessarily the case, however. For example, if all the values of y (j(J>\ip) are 
positive except for one negative value for fi = fx±, then (V'b'A t i)j/0'i) Mi + ^W) wm be negative. In 
that case, it is obvious that a smaller value of H^^+i will lead to a smaller variance. 

In general, the POM with H nm = 1 gives the smallest variance only for states where the phases 
of y (j(i\i>) vary as 

argyOXV') = W + 0o- (6.63) 
As is discussed in Ref. |1S|| , if the phases of y (j^\tp) have some more arbitrary variation 

argyO'MlV'} = <t>v (6-64) 
then the optimum POM (for the error in the phase) is given by 

m = ^~ E e^-^\jy) y { JV \e^-^. (6.65) 

I will call this the corrected canonical POM, to avoid confusion with the optimum measurements 
based on feedback. With this POM we find 

3-1 

<e l0 H E IWA*>y0> + lMI- ( 6 - 66 ) 

fi=~3 



Using this expression, the variance for the state of Eq. (6.60) is approximately 0.6863, which is much 
smaller than the variance for the measurements. This is possible because, although the are 
positive, the v (jfJ-\4>) are complex. 
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For most of the states considered in this study, the phases o{ y (jfj,\ip) vary linearly as in Eq. ( p3| ), 
so using the corr ected canonical POM does not change the results. The only exceptions [apart from 
the state of Eq. dslsob l are: 

1. The state (\j0) z + \jl) z )/\/2. This has a slightly smaller variance for corrected canonical mea- 
surements than for the usual canonical measurements. The results are not qualitatively changed, 
however, as the phase variance still remains on the order of 1 for very large photon numbers, due to 
the large peak at ±7T. This state is therefore still too poor to be useful. 

2. Arbitrary 2 photon states. For any 2 photon state where the phases of y {jfi\ip) do not vary 
linearly, although the feedback scheme of Sec. 6.3 gives a variance equal to the canonical variance, it 
does not give a variance as small as that for corrected canonical measurements. For other 2 photon 
states and for 1 photon states there is no change. 



Chapter 7 

Continuous Phase Measurements 



Now I will consider the case of continuous adaptive measurements, where we are continuously mea- 
suring a phase that is varying. This case is closer to what is usually done in practice, where rather 
than measuring a single, fixed phase of a pulse, a signal is transmitted in the varying phase of a 
continuous beam, and we wish to measure that varying phase. 



7.1 Continuous Dyne Measurements on Coherent States 

Firstly I will consider the case of continuous measurements on a single mode field. For this case it 
is simplest to consider coherent states. For continuous coherent states, the coherent amplitude a 
has a constant magnitude, but varying phase. As in the single-shot case, a quadrature of the field is 
measured by combining the mode to be measured with a large amplitude local oscillator field that 
is treated classically. The two fields at the two output ports of the beam splitter are given by 

b ± {t) = ±={a± 1 ), (7-1) 

where a is the operator for the mode to be measured and 7 is the amplitude of the local oscillator 
field. The instantaneous rate of photodetection at each photodetector is 

= i(H 2 + | 7 | 2 ±2Rc( a7 *)). (7.2) 

The signal of interest is the difference between the two photocurrents at the two detectors. The 
number of photocounts at each of the detectors in the time interval [t, t + St) will be denoted by 
SN±(t). I will use the usual definition of the signal photocurrent 

m . lim lim aMO -«"-<>) . m 

«->0| 7 |-«x> |7|(5i 
The expectation values of the increments 6N± (t) in the infinitesimal limit are 

(dN ± (t)) = (bi(t)b ± (t)) dt 

(dN 2 ± (t)) = (dN ± (t)) . (7.4) 
For large | — y | , SN±(t) can be approximated by the Gaussian increments 8W±(t): 

5N±(t) w K±6t + y/K±SW±(t), (7.5) 

where 

K± = i(M 2 + | 7 | 2 ±2Rc( a7 *)). (7.6) 
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Using this in the definition of the photocurrent ( |T.3| ) gives 
I ( t)=lim lin, M+W-SN-M 



2Rc (ae^O) St+ ^= (8W+{t) - SW-(t)) 



= lim 

= 2Re(ae-»W)+^, (7.7) 

where is the phase of 7. The result is therefore 

I(t)dt = 2Re (ae- l4>(i) ) eft + dW(t). (7.8) 

This is identical to the result in the case of single-shot measurements, except in this case the time 
is not scaled to the unit interval. 

In making adaptive phase measurements the phase of the local oscillator is usually taken to be 

*(t) =#(<)+ tt/2, (7.9) 

where (pit) is some estimate of the system phase <p(t). The only case where the feedback phase is not 
based on this (for dyne measurements) is for the corrections to the close-to-optimal measurements 
considered in Ch. ||. With this, the signal becomes 

I{t)dt = 2 |a| sin (ip(t) - tp{t)) dt + dW(t). (7.10) 
7.1.1 Linear Approximation 

Provided that the estimated system phase is sufficiently close to the actual system phase, we can 
make the linear approximation 

I(t)dt = 2 |a| (<p(t) - ip(t)) dt + dW(t). (7.11) 

Rearranging this equation gives 

2 \a\ <p(t)dt + I(t)dt = 2 |a| <p(t)dt + dW(t) 

, , I(t) , \ dW(t) , n . 

which gives 

W) + ^)=^)- (7.13) 

Therefore, under the linear approximation, each data point I(t) can be used to obtain an independent 
estimate of the phase. 

Now I will denote the best phase estimate based on all the data up to time t by O(i). I will also 
denote the best phase estimate based on the data in the infinitesimal time interval [t, t + dt) by 9{t). 
Note that these are the best phase estimates, in contrast to the phase estimate used in the feedback 
<p(t). The phase estimates 9 it) are given by 

m = w)+w\, aw) 



which has the expectation value tp{t) and the variance 

>(*)-¥>(*)) 



; = (dWjt)\ 2S 
\\2\a\dt) . 



5—. (7.15) 

4\a\ 2 dt 
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Here the simple definition of the variance has been used, rather than the Holevo phase variance, 
because we are using the linear approximation. 

If we were considering a measurement over the time interval [0, 1] , during which there are n — \a\ 2 
photons from the signal mode, then we would be effectively averaging over 1/dt estimates of the 
phase, each with a variance given by Eq. ( 7.15| ). As usual with averages, the variance in the mean is 



the variance of each data point divided by the number of data points. This means that the variance 
in the final phase estimate is 



'(41 



a 1 2 dt 



1/dt 4 |a| 2 

= k> (7 ' 16) 

which is the standard result for a coherent state under adaptive phase measurements. 

In this case we want to consider a measurement that is continued indefinitely. If the system 
phase is not varied, then the variance in the phase estimate will just go down indefinitely. In that 
case, however, no information is transmitted beyond a single real number. What we wish to do is 
vary the system phase in order to transmit information, as is the case for FM radio. 

Next is the question of how the system phase tp(t) will be varied. The simplest way of varying the 
system phase is to vary it stochastically via Wiener increments. I will therefore take the variation 
in the system phase to be 

tp(t + dt) = tp(t)+KdW'(t). (7.17) 

This Wiener increment is independent from that used previously for the photocurrent, as indicated 
by the prime. 

It is clear that the most recent data will be least affected by the variation in the system phase, 
and the data from further and further back will be less and less accurate. To quantify this, note 
that when the current system phase ip(t) is used as a reference, the variance in the system phase at 
some previous time t' is n 2 (t — t'). As the phase estimate 6(t') based on the time interval [t' , if + dt') 
is an estimate of the system phase at time t' , when it is considered relative to the current system 
phase its variance will be increased by K 2 (t — t'). Therefore the total variance in each individual 
phase estimate 9(f) is 

' K 2 (t-f). (7.18) 



<l\a\ 2 dt 



In order to determine the current best phase estimate 0(t), we would like to form a weighted 
average of each of the individual phas e esti mates 8(t'). If this is done in the usual way for weighted 



averages, using the variance given by ( 7.18 ), the results obtained are ridiculous, as the contribution 
to the variance from the variation in the system phase is infinitesimal as compared to the quantum 
noise. The problem is that the variation in the system phase makes the error in the phase estimates 
9(f) correlated, so the contribution from the variation in the system phase is infinitesimal over 
infinitesimal time intervals, but becomes significant over finite time intervals. This means that the 
usual method of performing weighted averages will not work. 

As an alternative approach, we can consider just the weighting of the latest phase estimate 9(f) 
as compared to the phase estimate from all the previous data 0(f). In this case 9(f) has no variance 
from the variation in the system phase, and so it is uncorrelated with the previous phase estimate 
and we can use a weighted average in the usual way. 

The equilibrium value of the variance of 6(t), with all the individual phase estimates correctly 
weighted, will be denoted by A8 2 . After a time dt the phase variance of O(t) with respect to the 
new system phase tp(t + dt), i.e. 

(Q(t) - <p(t + dt)) 2 ) , (7.19) 

will be A0 2 + K 2 dt. The variance in the phase estimate from the latest time interval, 9(f), will be 
given by Eq. ( [7.15 ). If we take a weighted average of 0(t) and 9(f), then the contributions from 
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each of the phase estimates from the individual time intervals should be correctly weighted, and the 
variance in the weighted average should be the equilibrium value, AO 2 . This implies that 

+ 4M 2 *=-U- (7.20) 



A0 2 + n 2 dt 1 1 A0 2 ' 

This expression can be used to determine the equilibrium value of the phase variance. Solving for 
A© 2 gives 

A0 2 = JL.. (7.21) 

Thus we find that for continuous measurements the phase variance scales as l/\a\ rather than l/|a| 2 . 
Showing explicitly how the weighted average is performed, 



0(f + dt) = ± 1 T7 — 2 . (7.22) 



(4\ a \ 2 dt) e(t) + g Wett) 

1/A0 2 
Simplifying this gives 

6 (f + dt) = (2\a\K dt) 6{t) + (1 - 2 \a\ k dt) (t) . (7.23) 
In terms of the increment in the phase estimate this is 

d&(t) + (2 \a\ ndt) 9 (t) = (2 \a\ ndt) 6{t). (7.24) 

Solving this we find that 

dQ(t)e 2 ^ Kt + (2 \a\ k dt) 6(t)e 2|a|Kt = (2\o\k dt) 6(t)e 2 ^ Kt 
d (e(t)e 2 l tt l Kt ) = (2\a\n) 6{t)e 2 ^ Kt dt 



t 



®{t) = 2\a\n J 9(s)e 2 ^ K ^ds. (7.25) 



Therefore this method corresponds to a simple negative exponential scaling of the weighting. 
We can also consider a more general negative exponential scaling given by 

t 

©(*) = X I e{s)e x{s - t Us. (7.26) 



Note that with this more general scaling, 0(f) is no longer necessarily the best phase estimate. For 
most of the remainder of this chapter, 0(f) will be used in this more general sense, rather than as 
specifically the best phase estimate. The best phase estimate will be found by finding the optimum 
value of x- Taking the derivative of this expression with respect to time gives 

rf(0(f)e xt ) = X 0(t)e xt dt 

dQ(t) = X dt(6(t)-Q(t)) 
0(f + dt) = xdt0(t) + (1 - X dt) 0(f). (7.27) 

This means that this method is again a weighted average, except with a weighting that is not 
optimum. If we find the variance of both sides of this equation we obtain 

var(0(f + dt)) = (xrff) 2 var (0(f)) + (l-x^) 2 var (0(f)) 

A0 2 = -^L- + (1 - 2 X dt) (A0 2 + n 2 dt) 
A\a\ dt 

A0 2 = ^2 +A0 2 + K 2 df-2 x dfA0 2 . (7.28) 
4 \a\ 
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Solving for A8 2 gives 



2 

A6 2 = ^^ + ^. (7.29) 
8|a| 2 2 X 



The optimum value of x can De verified from this equation. Taking the derivative with respect to x 
gives 



2 



JL (A6 2 ) = — ^ - (7.30) 
% V ' 8\a\ 2 2 X 2 



For the variance to be minimised this must be zero, so 



1 



.2 



8\a\ 2 2 X 2 

X = 2|a|/s. (7.31) 

This is the exponential constant which was found directly. Substituting this value of x m to the 
expression for the variance gives 

A8 2 = 2 Id k k 2 



S|a| 

K 

2~M 



(7.32) 



which is the result found for the phase variance in Eq. (7.21). 



7.1.2 Exact Case 

The results of the previous section are all using the linear approximation ( |7.1l[ ). Although this 
approximation is very useful for obtaining the asymptotic value of the variance, it does not directly 



tell us what to do in the exact case. In the exact case for single-shot measurements (see Sec. 1.5), 
rather than forming independent phase estimates from each time interval dt and then averaging 
them, we determine A v and B Vl and the phase estimate is given by 

<8>{v) = axg (vA v +B v A* v ). (7.33) 

Therefore the average phase estimate with exponential weighting does not make much sense in this 
case. In addition the intermediate phase estimate must be considered. An alternative approach is to 
use an exponential weighting in determining A v and B v , and then use these to determine the phase 
estimate. Specifically, I will replace the definitions of A v and B v , 



A v = J e tq, I{u)du 
o 

v 

B v = — [ e 2i *du, (7.34) 



by 



.4 



J e X{u~ V*i» du 
— oo 

B t = - f e^-^e^du. (7.35) 
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Then &rgA t can still be used as the intermediate phase estimate. I will not consider any better 
intermediate phase estimates here, as these only give very small improvements over the mark II case 
for coherent states. To find a phase estimate to use for &(t), we can use a similar approach to that 
used in Ref. p5|. If the system phase is constant, then we find 



A t = I e x{ - u -^e l *I(u)du 

e x(«-t) e i* [(ae-^ + a*e**) du + dW(u)] 



t Z I 

a J e^-^du + a* J e^ u ~ t h 2i9 du+ J e^ u ~^ e l<s> dW (u) 



a 



where 



= - - a*B t +ia t . (7.36) 

X 



a t = I e^-^e^-^dWin). (7.37) 



This result is analogous to the result ( 1.77 ) for the case of single-shot measurements, except with v 
replaced with 1/x- Note that from this derivation it naturally emerges that we should use the same 
exponential scaling for B t as for A t . From Eq. (|7.36|) it can be shown that 



At + X BtA\ =a(^- X \B t \ 2 ^j + ia t - %xBt<^. (7.38) 

This means that 

(A t +xB t At)tta(~-x\B t \ 2 y (7.39) 

Similarly to the single-shot case this is not necessarily exact if the local oscillator phase is dependent 
on the measurement record, but it should still be approximately true. Therefore the phase estimate 
that will be used here is 

B(t) =&ig(A t + X BtA*). (7.40) 

Note that the factor of x makes sense, as the time over which the data is used is effectively 1/x- 
Similarly to the single-shot case, I will define the variable Ct = At + xBtA^, so 0(f) = argCt. The 
above derivation is not exact if the system phase is not constant; however, arg Ct should still be a 
good estimator for the phase. 

A differential equation for the feedback phase can be determined in a similar way as in Ref. |f}5| . 
Using Eq. (7.35), we can determine the increment in A t : 

i 



e xt A t = J e xu e^I (u) du 

— CO 

d (e xt A t ) = e xt e^I(t)dt 

dA t = e l ®I{t)dt - xA t dt. (7.41) 



Taking the local oscillator phase to be 



$(t)=arg4 t + |, (7.42) 
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as in the case of mark II measurements, we find that 



dA t = i-^±-I(t)dt - x A t dt. 



(7.43) 



So the magnitude of A t varies as 



d \A t \ 2 = A* t (dAt) + (dA*)A t + (dA*) (dA t ) 



= a: 



ij^-I(t)dt - xAdt 



A* 

-iy±-I(t)dt - xA* t dt ) At + dt 



= (l-2 x \A t \ 2 )dt. 



(7.44) 



This demonstrates that, rather than increasing linearly as in the standard case, \A t \ increases up to 
an equilibrium value given by 

\A,f - f . 

Using this result, the increment in the feedback phase is 



(7.45) 



d<f>(t) = Im[dlnA t ] 

~dA (dA t ) 
At 



Im 



2A\ 



= Im 



I(t)dt - \A t dt 



+ 



\At\ 



rdt 



2A\ 



I(t)dt 

\M 

fyl(t)dt. 



(7.46) 

Therefore the feedback for this case is actually much simpler than for the single-shot case. The 
feedback phase just changes linearly with the signal, and there is no \ft scaling as there is in the 
standard case. 

Using this result gives the stochastic differential equation for the phase estimate (p(t) as 



d$(t) = ^J2 X [I(t)dt] 

= V^X [2 \a\ wn(<p(t) - <p(t))dt + dW(t)] . (7.47) 

Unlike the standard case, no change of variables is required, as there is no factor of \fi. Making a 
linear approximation gives 



d<p(t) = V2x [2 |a| (<p(t) - <p(t))dt + dW(t)] . 



Rearranging this gives 



dip(t) + 2 \a\ y/2x<p(t)dt = 2 \a\ ^2 X ip(t)dt + ^2 X dW(t) 



d 



<f(t) 



_ e 2\a\^t 



2 \a\ y/2 X ip(t)dt + v / 2 x dW(t) 



Integrating then gives the solution as 



(7.48) 



<p(t) = y/2x J e 2 l Q l^("-*) [2 \a\ <p(u)du + dW(u)] . 



(7.49) 
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If the phase is measured relative to the current system phase, then 

<p(u) = -K J dW'(s). 

u 

Using this, the solution for the phase estimate is 

t 

dW{u)-2\a\n [ dW'(s)du 



(7.50) 



m = V^X J e 2 H^(«-*) 

— oo 

The variance in this phase estimate will be 
, t t 

((p 2 (t)) = 2 X ( [ d Ul [ du 2 e 2 ^^ Ul+U2 - 2t h\a\ 2 k 2 I I dW \s 2 )dW (si) 



(7.51) 



-oo — OO 

/ t t 



-2x- 



= 2|a| v ^(Mi+n 2 -2t) 



>dW(u 2 )dW(m) J 
t 

8 X \a\ 2 n 2 J du, J du2e 2\«\V^(u 1+ u 2 -2t) J ds + 2x J e 4 l« 

— oo — oo max(iii,U2) 

t U\ t 

8*|a|V J d Ul J du2e 2HV2x(u 1+ u 2 -2t) J ds 

— oo — OO 111 

t t t 

+8 X \a\ 2 K 2 ( dm I du2e 2|"l^(«i+«2-2t) f ds + 2x 



-oo — oo 

t t 



\^x(u-t) du 



— OO U\ 
t U\ 



a 4\a\^2^(u-t) 



4|a|V2x 



8 X \a\ 2 k 2 I dm I e 



= 2|a|V^x(«l+«2-2i 



' (t — m)du2 



-oo — oo 

t t 



+8*|a|V J dm J e 2 \ a ^ u ^~ 2t \t-u 2 )du 2 + ^ 

— OO U\ 
t U\ 

16xM 2 k 2 / dm I e 2lal ^ Ul+U2 ~ 2t) (t-m)du 2 + 



i\a\ 

4 Id 



2 ^2 



16x \cx\ k 



-oo — OO 

/ 



e 4MV3x(«i-t)(t_ Ul )d Ul + 



4 a 



16% |a| k 



2 ^2 



2X 



2H^(4|a|V23e) 2 4|a| 
« 2 , V2~X 



~ 4\a\V2x~ ' 4|a|' (7 ' 52) 

This result for the variance of the intermediate phase estimate is quite different from that for the 
variance of 0(f) given in Eq. (7.29). Of particular interest is the fact that the contribution due to 
the variation in the system phase is not simply k 2 /(2x), as is usually the case. 

At first it would seem that the method used to find the mark II phase variance in |55| would be 
applicable here also. It turns out that this method is not useable, because when the system phase 
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varies it is not possible to separate t he dif ferent terms as in Eq. (7.36). A more promising way is to 
use a method similar to that in Eq. (7.52). The phase estimate can be simplified to 



0(t) = Mg{A t + x B t A* t ) 

= argA t + arg (1 + X B t A* t /A t ) 

= ^) + arg(l + X e- 2 ^ (t) i? t ) 

Expressing B v as an integral gives 

9(f) = <p(t) + arg ^1 + X e~ 2i ^ [ e*< u -*> e 2i ^ u Uu 
Expanding the exponentials to first order we get 

Q(t) re <p(t) + arg ( 1 + x (1 - 2i^(t)) / e^" - *) (1 + 2i<p(u)) du 

= ip{t) + arg 



(7.53) 



(7.54) 



1 + (1 - 2t£(i)) 1 + 2ix J e^ u -^if(u)du 

\ — OO 

<p(t) + arg I 2 - 2i^(*) + 2i X J e x{u ~ t] '<p(u)du 

\ — OO 

<^(i) + arg I 1 - iip(t) + i x J e x{ - u ~ t] ] $[u)dv 

\ — OO 

t 

<p(t) - <p{t) + X { e x{u - l) (p{u)du 



e^-^du. 



(7.55) 



This demonstrates that the mark II phase estimate is approximately a weighted average of the 
intermediate phase estimates, just as in the standard case it is approximately a normal average. 
Note also the similarity of this result to the result for the linear case (7.26). Unfortunately the 
simple technique used in the linear case cannot be applied here. The problem here is that the phase 
estimates <p(t) are based on the previous data, not just on the data from the infinitesimal time 
interval [t, t + dt). This means that tp(t) is not independent of O(i), and we therefore cannot use the 
simple techniques based on weighted averages. 



Using the result (7.51) for the intermediate phase estimate gives 



Q(t) w 



e 2\ a \^(u-v) \dW{u)-2\a\n / dW (s)du 



dv. 



(7.56) 



Note that the integral for the system phase variation is taken up to time t, rather than the time 
of the intermediate phase estimate. This is because the phase is measured relative to the current 
system phase, rather than the system phase at the time of that intermediate phase estimate. 
In order to determine the variance, we need to evaluate 

,tt vi v 2 > 

(e 2 (t)) = 2 X 3 ( f dvi f dv 2 e x{vi+V2 ~ 2t) f dW(ui) f dW(u 2 )e 2 ^^ ( - ul+u ^ Vl ~ V2 A 
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Vl V2 



+8 X 3 \a\ 2 K 2 ( I dvi I dv 2 e x ( Vl+V2 ~ 2t) j dm / d U2 e 2 HV^(«i+«2-^i-^) jdW'{ Sl ) ldW'{s 2 )J. 

(7.57) 



-oo — oo 



-oo — oo 



This is a lengthy calculation, and is performed in Appendix A. 4. The result found is 



<e 2 W) 



8\ar 2 X 



(7.58) 



This is exactly the same result as for the simplified linear case (7.29), and the minimum phase 
variance is 

(6 2 (i)) . (7.59) 
\ v j/ mm 2 | a | 

X = 2|a|«. (7.60) 



for 



7.2 Continuous Heterodyne Measurements 

In order to determine how much of an improvement feedback gives for continuous measurements, I 
will compare it with the case of continuous heterodyne measurements. For heterodyne measurements 
on a pulsed coherent state, the introduced phase variance is equal to the intrinsic phase variance. 
This indicates that the first term in Eq. (7.58) should be double for the heterodyne case, so the 
phase variance is 

This can be shown more rigorously using a similar technique to that used in |3j| . Expanding A v 
gives 

t 

A t = [ e xi - u -^e l,i,{u) I(u)du 



OO 
t 

e x(«-t) e i*(«) [( ae - 1 * + aV*) du + dW(u)] 

oo 

t t 



|a| / e x{u ~ t) e t ^ u) du+ \a\ / e x{u ~ t) e 2l ' s ' < - u) -' l(p{u} du+ / e* (u -*>e w ( u) <W(lt). (7.62) 



For the heterodyne case, the local oscillator phase <i>(i) varies very rapidly, so the second term above 
will be very small. This means that A t simplifies to 

t 

A t = \a\ [ e^-^e^du + iat. (7.63) 



Since B v is negligible, the phase estimate 9(i) simplifies to 

6(t)=argA„. (7.64) 

As above, the phase will be measured relative to the current system phase. In the limit of small 
phase variance, the system phase does not vary significantly during the time 1/x, so we can take 
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the linear approximation, giving 

t 

A t t*\a\ J e*< u -*>(l + i¥?(u))du + i<7 t 

— oo 

t 

i\a\ J e x{u ~ l) ip{u)du + ia t . 



+ i 



(7.65) 



Using this the phase estimate is 



0(t) = Im 



= Im 



Im 



a 



log | — + i 

X 



\a\ J e^ u '^ip(u)du + 



ia t 



log 1 + ix 



Z 

iX J e*("-'V( 



u)du + ixcftl H 



iX / e 



(7.66) 



In the last line the linear approximation has again been used. Further evaluating this gives 

t 

®{t)=X J ^-%{u)du+-^(a t + <T* t ) 

— OO 

t t 

= -k X J due^-*) J dW'(s) + ^(a t +a* t ). 



(7.67) 



The variance is therefore 

, t t t t 

(e 2 (t)) = K 2 X 2 /j dm J du 2 e^ +u ^ JdW'( Sl ) J dW'(s 2 )) + j^ ((a t + a;f). (T.(iS) 

— co — oo u 1 u 2 

The first term here can be evaluated to give 

I t t t t < 

I J dm J du 2 e^ Ul+U2 - 2 ^ J dW'{ Sl ) j dW'{s 2 )\ 

— OO — OO Ui U2 

t t t 

= J d Sl J ds 2 e^ sl+S2 - 2t ^ J du 

— oo — oo max(si,s 2 ) 

t Si t t t t 

= J d Sl J ds 2 e xisi+S2 - 2t) Jdu+ J d Sl J ds 2 e x(si+S2 - 2t) J du 

— OO —CO Si — OO Si S2 

t Si t t 

= J d Sl J ds 2 e x{si+S2 - 2t Ht - 8l ) + J d«i J ds 2 e x{si+S2 - 2t \t-s 2 ) 

— oo — OO 

t Si 

= 2 J dsi J ds 2 e^ Sl+S2 - 2t \t - si) 



-OO Si 



-oo — oo 
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d Sl e 2x ^- t] {t- si) 



X 

— OO 

1 

In addition, it is easy to show that 



(7.69) 



(a 2 ) = - / e^-V^W 



(7.70) 



Note that (cr 2 ) Bt, so these variables are not completely analogous to those defined in the single- 
shot case. Nevertheless, as $ is rotating rapidly in the heterodyne case, we should still find that 
(of) w 0. Similarly, evaluating (|cr t | 2 ) gives 



x(«i+«2-2t) i(*(ui)-*(u 2 )) 



dW(t»i)dW(ii 2 ) 



1 

2* 



Using these results, as well as Eq. (7.69), the variance is 



2x 4|a| 2 ' 



This shows that Eq. (7.61) is correct. Taking the derivative of Eq. (7.61) gives 



Therefore the variance is minimised by 



and the minimum variance is 



X = V2k\o>\ , 

(e 2 (i)) . 

\ v ' / mi] 



V2\a\ 



(7.71) 



(7.72) 



(7.73) 



(7.74) 



(7.75) 



The minimum phase variance is therefore y/2 times the minimum phase variance for the adaptive 
case. 



7.3 Results for Continuous Dyne Measurements 

In order to verify these approximate analytic results, the equilibrium phase variance was determined 
numerically for a variety of parameters. Although it at first appears that there are three parameters 
that should be varied, \a\, k and x, these parameters are not completely independent, and we need 
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only consider variation in two parameters. To see this, consider the equations for this system: 

I(t)dt = 2 |a| sin (<p(t) - ip(t)) dt + dW(t) 

ip(t + dt) = ip(t) +KdW'(t) 
t 



At = 



J e x(«-*) e i*j( u )d u 

-OO 

t 



Q(t) = axg{A t + X B t A* t ). (7.76) 
Consider a change in the time variable 

t' = t/X 2 . (7.77) 
For this change in the time variable, the variables I(t), A t and B t should be scaled to 

I'(t') = \I(t>) 
A' t , = Af /X 

B>, = B v I A 2 . (7.78) 

This can be done because these are merely intermediate variables. With these substitutions the 
equations become 

I'(t')dt' = 2A |a| sin {<p(t') - ip(t')) dt' + dW{t') 
<p(t' + dt') = (p(t') + XndW'{t') 
t' 

A' t , = [ e x2 ^ u -^e^I'(u)du 



t' 

B' r j e^^'h^du 



Q{t') = & vg{A' t ,+\ 2 x B' t ,A' t ;). (7.79) 
This time scaling is therefore equivalent to making the substitutions 

\a\ — > A|a| 

K — > \k 

X - A 2 X . (7.80) 

If A = 1/ \a\, then \a\ is replaced in the equations with 1, k is replaced with k/ \a\, and \ is replaced 
with x/\ a \ 2 - This means that the results depend only on the ratios n/\a\ and x/|a| 2 . If the 
individual values of k, \a\ and x are varied while keeping these ratios the same, then the results will 
not change. It is therefore convenient to define the variables 

K = A (7.81) 

M 

X = -\. (7.82) 

M 

Note that these variables are equal to the values of k and \ if M IS equal to 1. In terms of these 
variables, the minimum phase variance for adaptive measurements is K/2, and the optimum value 
of X is 2K. 
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Figure 7.1: The phase variance for continuous adaptive measurements for X = 2K. The numerical 
results are shown as crosses and the approximate analytic relation K/2 is shown as the continuous 
line. 



The value of K was varied from 1 down to 2 x 10 19 . For each value of K, X was varied from a 
quarter to four times its optimum value of 2K. The time steps used were 



At = 



1 



10 3 X' 



(7.83) 



For these calculations 1024 simultaneous integrations were performed and the variance was sampled 
repeatedly. The integrations were taken up to time 10/X, in order for the variance to reach its 
equilibrium value, then the variance was sampled at time intervals of 1/X up until time 100/X. 

2K are plotted in Fig. 7.1. The variances for K = 1 to 5 x 10~ 7 are the 
are the standard variances. As can be seen, the results are 



The results for X 
Holevo variances, and for below 5 x 10 



very close to the analytic expression. To show the improvement over heterodyne measurements, the 
ratio of the minimum phase variance for adaptive measureme nts to the minimum phase variance for 
heterodyne measurements (with X = -\/2K) is plotted in Fig. 1JI. The ratio is close to 1 for large K, 
but for smaller K the ratio gets closer and closer to l/\/2. 

In order to see the differences from the analytic expression more clearly, the ratios of the variances 
to the analytic values for adaptive and heterodyne measurements are plotted in Fig. 7.3, As can 



be seen the phase variances do differ significantly from the analytic values for large K, but the 
agreement is extremely good for small K. This can be expected, because the approximations made 
are for the limit of large \a\, which is equivalent to small K. Also the agreement at large K is slightly 
better for heterodyne measurements than for adaptive measurements. 



Alternatively we can plot the phase variance as a function of X for fixed K. In Fig. 7.4 I have shown 
the phase variance as a function of X for K = 0.001 for adaptive and heterodyne measurements. The 
numerical results agree reasonably closely with the analytic values, although there is a noticeable 
difference for adaptive measurements for the larger values of X. Note that the minimum phase 
variance for adaptive measurements is at X = 2K, and the minimum phase variance for heterodyne 
mea sure ments is larger and at a smaller value of X. When the value of K is reduced further, as in 
Fig. |7.5|, the numerical results agree even more closely with the analytic values. 
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Figure 7.2: The ratio of the minimum phase variance for continuous adaptive measurements to the 
minimum phase variance for continuous heterodyne phase measurements. 



2.8- 
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£2.4- 
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Figure 7.3: The ratio between the numerically obtained phase variance and the analytic expression 
K/2 for adaptive measurements (continuous line), and K/y/2 for heterodyne measurements (dotted 
line). 
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Figure 7.4: The phase variance as a function of X for K = 0.001. The numerical results for adaptive 
and heterodyne measurements are shown as the crosses and pluses respectively and the approximate 
analytic results for adaptive and heterodyne measurements are shown as the continuous line and 
dotted line respectively. 



x 10" 



+ 




Figure 7.5: The phase variance as a function of X for K = 10~ 9 . The numerical results for adaptive 
and heterodyne measurements are shown as the crosses and pluses respectively and the approximate 
analytic results for adaptive and heterodyne measurements are shown as the continuous line and 
dotted line respectively. 
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7.4 Continuous Squeezed State Measurements 

It is also possible to consider dyne measurements on continuous squeezed states. At first it appears 
that it does not make sense to consider squeezed states in the continuous case. This is because, in 
the single-shot case, the reduced variance is due to the back-action of the measurement on the state. 
The photocurrent is given by 

I(t)dt = 2Re((a) e^ $(t) )di + dW(t). (7.84) 

Here there is no factor multiplying dW, so the variances in the individual values of I(t) are not 
reduced for a squeezed state. Instead, the reduced phase variance conies from the variation of (a) 
during the measurement. If we are considering continuous measurements, then the state should 
remain constant during the measurement. This means that (a) is constant, and the phase variance 
cannot be reduced by this factor. 

For the state to remain constant, we must consider a squeezed state produced by a driven 
parametric oscillator in the limit that the decay time of the cavity is extremely short. This limit 
must be taken before the limit St — > in the definition for the photocurrent. In this limit, the 
changes in the state over the time scale of the decay constant result in reduced noise, but these 
changes do not persist over the time interval 5t, which is much larger than the decay time. 

The net result of this is that when we take the limit (5t — > the photocurrent is given by 

I(t)dt = 2Rc(ae- l ^)dt + yj ' e~ 2r cos 2 ($ - 4>J2) + e 2r sin 2 ($ - </> c /2)dW(t), (7.85) 

where a is the amplitude of the squeezed state, and r and (f)^ are the magnitude and direction of 
the squeezing. Therefore in this case, rather than the uncertainty being reduced by variation in (a), 
it is reduced by a multiplying factor for dW . 

For reduced phase uncertainty, the phase of the squeezing should be <j>^ = 2<p + ir, where ip is the 
system phase. If we are using feedback given by 

$ = <^ + 7r/2, (7.86) 

where ip is an estimate of the phase, then the photocurrent can be expressed as 

I(t)dt = 2\a\ sin(v3 - <p)dt + \J e- 2r cos 2 (p - ip) + e 2r sin 2 (ip - ip)dW(t). (7.87) 

It is clear that if the intermediate phase estimate used is very close to the system phase, then the 
factor multiplying dW will be close to e~ r and will be at a minimum. The better the intermediate 
phase estimate is, the smaller this multiplying factor will be. If the intermediate phase estimate is not 
perfect, it is clear that increasing the squeezing past a certain level will not reduce the multiplying 
factor. This is because the e 2r term will start to dominate. 

It is possible to estimate the optimum squeezing and the minimum phase variance under these 
measurements using the linear approximation. In this approximation, the variance in the individual 
phase estimates 6(t) is equal to 

e- 2r cos 2 (y -<p) + e 2r siri 2 {p - <p) 
~ " 1 a ' 2 dt 

It is clear that the minimum phase variance (in this approximation) will be obtained when the best 
phase estimates are used for (p. It is therefore reasonable to use the phase estimates ©(t) for ip. 
These will be the best phase estimates when the correct value of \ is used. As the variance of these 
estimates is AG 2 , we obtain 

(e- 2r cos 2 (^-^)+e 2r sin 2 (^-^)) « e~ 2r + e 2r A9 2 . (7.89) 

This approximation will be true for small phase variances and large squeezing. Following the same 
derivation as for the coherent state case, the only difference is the multiplying factor, so we obtain 

A6 2 = (e- 2 ^ + e ^A9 2 ) + f . (7.90) 
8 a l X 
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Solving this for AO 2 gives 



X'-" 



AG 2 = 



8M 2 



2X 



1 



(7.91) 



8|a| 2 



This expression has two independent variables, x an d r i that can be varied in order to find the 
minimum phase variance. Taking the derivative of Eq. (7.9C) with respect to x gives 



<9AG 2 



1 



"AO 2 



X e 2r <9AG 2 



dx 8|a| : 

Since the derivative is zero for the minimum this gives 

1 



a 







8 Id 



-2r | Jlr 



e^A8 2 



2X 



2 ' 



Together with Eq. ( 7.9C ) , this gives 



AG 2 



Substituting this into Eq. ( 7.90 ) gives 



AG 2 - 



-2r 



4 Id 



Taking the derivative of this with respect to r gives 



AG 5 



<9A6 2 
dr 



2 Id 



,2r 



-2r 



-2r 



dAQ< 



AG 2 (AG 2 ) 2 dr 



Since the derivative is zero at the minimum, this becomes 



= 



K 



-2r 



,2r 



This can be solved to give 



2\a\* \ A9 2 
er ir = AO 2 



Substituting this back into Eq. (7.95) gives the phase variance as 

AG 2 - , _ 
V2| 



4/3 



(7.92) 

(7.93) 
(7.94) 
(7.95) 

(7.96) 

(7.97) 
(7.98) 

(7.99) 



Thus we see that even for an arbitrarily squeezed state, the best scaling we can obtain for the 
phase variance is |d -4 / 3 , a s compared to |a| _1 for a coherent state. This difference is less than for 
single-shot measurements, where optimum squeezed states scale as almost n~ 2 , as compared to n 
for coherent states. 



7.5 Results for Squeezed States 

The results for the continuous squeezed state case were obtained by a similar method as for the 
coherent state case. Similarly to the case for coherent states, only variation in the variables K and 
X was considered, rather than in all three variables \a\, k and x- The step sizes used were 

At -ik- (7 - 100) 

The integrations were taken up to time 30/X, then the variance was sampled every time step until 
time 130/X. The integration was performed using the photocurrent given in Eq. ( 7.85| ), except with 
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Figure 7.6: The phase variance as a function of K for continuous squeezed states. The continuous 
line is the theoretical analytic relation, and the crosses are the numerical results. 



the time scaled such that |a| was not required (similarly to the coherent state case). The phase of 
the squeezing was assumed at all times to be in the correct direction for reduced phase variance, i.e. 
negative with respect to the coherent amplitude. 

It was found that when the phase estimate arg Ct was used in the feedback, very poor results 
were obtained. This is a similar result to the case for single-shot measurements, where using arg C v 
feedback results in large phase variances. This is because, when the intermediate phase estimates 
are extremely good, the results do not distinguish easily between the real system phase and the 
system phase plus tt. The means that many of the results are out by 7r, resulting in a large overall 
phase variance. 

In order to avoid this problem, rather than using arg Ct in the feedback, an intermediate phase 
estimate given by 

<p(t) = arg(C t 1 - E Af), (7.101) 

was used. Note that this is similar to the phase estimate used to obtain phase measurements close 
to optimum in the single-shot case. Here a constant e was used, as the system state is not changing 
like in the single-shot case. 

For each value of K there are three variables that can be altered to minimise the phase variance: 
X, r and e. It is not calculationally feasible to consider a range of values for all three variables. 
Rather than considering a range of values, the values of the variables were varied in order to find 
the values that gave the minimum phase variance. 

The minimum phase variances obtained by this method are plotted as a function of K in Fig. |7.6j . 
The values given by the approximate analytic expression ( 7.99| ) are also shown in this figure. The 
numerical results are higher than the analytic expression, but for small K th ey h ave the same scaling. 
If we plot the ratio of the numerical results to the analytic values as in Fig. 7/7, we find that for the 
smallest values of K the ratio levels off at about 2.6. Thus we see that the stochastic results also 
give a scaling of \a\ 



-4/3 



Now note that, from Eqs ( [7.98 ) and ( 7.99 ), the optimum value of e 2r should be 
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Figure 7.7: The ratio of the numerically obtained phase variance to the analytic relation as a function 
of K for continuous squeezed states. 



V2J 



2/3 



Similarly, from Eqs ( 7.94 ) and ( 7.99 ), the optimum value of \ should be 



X = 2 |a| k 



2/3 



The corresponding optimum value of X is 



X = (2K) 



2/3 



(7.102) 



(7.103) 



(7.104) 



This indicates that the optimum values for both e~ 2r and X should scale as K 2 / 3 . 

The numerically obtained optimum values of e~ 2r and X, as well as these analytic expressions, 
are plotted in Fig. 7.8. Similarly to the case for the phase variance, the scaling is the same as that 
predicted analytically, but the scaling constants are different. For the case of e~ 2r , the optimum 
values are about 8 times those analytically predicted, whereas the values of X are around a third of 
those analytically predicted. 

For the case of s there is no analytic prediction for the optimum value. The numerically obtained 
values are shown in Fig. |7.9| , and as can be seen e decreases in a regular way with k. A power law 
was fitted to these values (for K < 1), and the power found was 0.70 ± 0.01. This is very similar to 
the K 2 / 3 scaling found for e~ 2r and X. 

A problem with these results is that they do not take account of the low probability results with 
large error. Unfortunately it is not possible to take account of these results in an analytic way as 
in the case of single shot measurements on squeezed states. The best that can be done is to use 
a very large number of samples. For these calculations the phase variance was sampled every time 
step, resulting in about 10 8 samples. Unfortunately the phase estimates from adjacent time steps 
are strongly correlated, so there is only around 10 5 independent samples. 

Despite the large number of samples used, there were generally no large error samples for the 
results shown. The optimum parameters found usually gave the minimum variance because varying 
the parameters beyond these values, such that the variance would be smaller according to the linear 
theory, resulted in samples with large error. 
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Figure 7.8: The optimum values of e~ 2r and X for measurements on continuous squeezed states. 
The numerically found values of e~ 2r are plotted as crosses, and the approximate analytic expression 
as a continuous line. The numerically found values of X are plotted as pluses, and the approximate 
analytic expression as a dotted line. 




Figure 7.9: The optimum values of e for measurements on continuous squeezed states. The crosses 
are the numerically found values, and the continuous line is the expression fitted to the data. 
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7.6 Continuous Interferometric Measurements 

Now I will consider the case of continuous measurements for two-mode interferometry. In this case 
we have a Mach-Zehnder interferometer as in Ch. 0, and are attempting to continuously track the 
phase in one arm, and control the phase in the other arm in order to obtain the best possible 
estimate of the phase. The optimum states discussed in Ch. || make little sense in this context, so 
instead I will consider the state with all the photons in one arm. This state can be generalised to 
the continuous case by considering a state with N photons per unit time. 

This case is essentially semiclassical, and the detections can be considered independently. There- 
fore, consider a single photon incident on port a, so the state is h) z . Upon detection the unnor- 
malised state changes as 

\yj(u,<p)) =c u (ipM. (7.105) 

Using Eq. ( |6.5| ) for the detection operator, the probability of detecting the photon in detector u is 
given by 

2 / if — <f> + un 



¥>)> = s in 2 ) ■ (7106) 

Using Bayes' theorem, the probability distribution for the system phase after the detection is pro- 
portional to this probability times the initial phase distribution. 

The probability distribution for the phase based on m of these independent detections, P(ip\n m ), 
can be expressed as 

m 

P{^\n m )= p mk(n m )e ikv . (7.107) 

k— — m 



It is straightforward to show from Eq. ( 7.106| ) that the coefficients Pmk(n r n) can be determined by 



Pmk{n m ) <x P m -i. k {n m -i) - ie- i (*-- u "' ,r )P m _ 1)fc _ 1 (n m _ 1 ) - \e l ^~ u ^ P m ^. k+1 {n m ^). 

(7.108) 

The normalisation condition on the probability distribution becomes 

P m0 {n m ) = 1. (7.109) 

The normalised probability distribution can be obtained by a simple addition to the recursion rela- 
tion: 



Pynk ij^m) 



P m -i,k(nm-i) - ±e- < (*™-"<»' r )P ro _ 1 , fe _ 1 (n m _ 1 ) - y i ^- u ^ P m -i.k+i(n m -i) 



PmO ( n m ) 

(7.110) 

I will consider the same variation in the system phase as in the case of dyne measurements, 

(p(t + dt) — <p(t) + ndW(t). (7.1H) 

When the phase varies in time, the time between detections is important. For a photon flux of N, 
the probability of a photodetection in time dt is Ndt. The probability distribution for the time 
between detections is given by 

Pp(t)dt = Ne~ Nt dt. (7.112) 

In the results that will be presented here, the time between detections, At, was determined according 
to this probability distribution. 

Now in order to determine the effect of this phase diffusion on the probability distribution between 
detections, we must first consider the effect over some very small time interval St. This is necessary 
because the probability distribution for the change in the system phase over time At does not go to 
zero for Atp = ±7r. This means that the probability distribution will not be exactly Gaussian, due 
to the overlap. In contrast, if we look at a very small time interval 5t, the change in the phase will 
have a normal distribution with a variance of K 2 5t. Explicitly the probability distribution is 

P G (A^(A^) = -^=e-^ 2 l^ 25 ^d{A V ). (7.113) 
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The probability distribution for the phase after time St will be the convolution of the initial 
probability distribution with the Gaussian described by Eq. ( 7.113| ). Evaluating this convolution 
gives 



P'(<p\n m ) = I P(tp-0\n m )P G {6)de 

l/2 



k=-m/2 

As St is assumed to be small, n 2 5t <C 1, so 



/ Yl P mk (n m )e lk ^P G (9)d9 

_ w k=-m/2 

m/2 * 

p mk(n m y k * J e~* ke P G (0)dO. (7.114) 



J.2.-2 



e~ lktt P G {6)d6 = er k K ' . (7.115) 



The effect of the variation of the system phase on the probability distribution is therefore 

m/2 

P 5t ^\n m )= Y P m k{n m )e- k2 ^ St/2 e^. (7.116) 

k=-m/2 

This shows that the coefficients for the probability distribution are just multiplied by a Gaussian: 

P^kM = PmkMe- k2ti25t/2 . (7.117) 

This result is related to the usual result for convolutions and Fourier transforms. 

In order to take account of the effect of the phase diffusion on the probability distribution over 
some significant time interval At, this time interval can be divided into M small time intervals St. 
Then we find 

M 

p mk( n m) = Pnk(n m ) Y[ e 

3=1 

L.2 2 



£"i 5t / 2 

P m u(n m )e~ k2 * 2 ^' 2 . (7.118) 



P m k(n m )e h 



This result can be used to take account of the variation of the system phase very simply in the 
probability distribution. 

As time passes the effect of Eq. ( 7.110f ) is to broaden the distribution of probability coefficients 



in k, corresponding to a smaller variance in the phase distribution. In contrast the Gaussian term 



in Eq. ( 7.118 ) tends to narrow the distribution of probability coefficients, corresponding to a greater 
phase variance. The initially broad phase distribution narrows until an approximate equilibrium is 
reached, where the two effects cancel each other out. 

The derivation for the optimum phase estimate for the single-shot case given in Eq. ( 6.1 9| ) is 
general enough to hold in this case also. Therefore the optimal phase estimate is given by 

£ = arg(e^) 

= argP ro ,_i(n m ). (7.119) 
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In addition, much of the reasoning for the feedback phase still holds. The phase variance after the 
next detection can be minimised by minimising the value of 



M(3 m )= J2 



P(n m \ip)e^dip 



(7.120) 



The values of P(n m \ip) can be obtained, except for a normalising constant that is common to 
u m = and 1, by using Eq. (7.108). This means that we can express M($ m ) as in Eq. ( |6.33| ) with 
the parameters a, b and c given by 



a = P m -\ t -i(n m -\) 

b = \Pm-\,-l{nm-\) 
C = 7;P m -ifi(n m -i). 



(7.121) 



These values of a, b and c can be used to determine the feedback phase as in Sec. 6.3 



The phase uncertainty at equilibrium can be estimated using a similar approach as was used 
for the single mode case. Let us assume that the equilibrium variance in the best estimate for the 
system phase is A0 2 . After time At, the variance in this phase estimate with respect to the new 
system phase, (pit + At), will be A8 2 + n 2 At. In the equilibrium case this increase in the variance 
should, on average, be balanced by the decrease due to the next detection. 

We now wish to estimate the equilibrium variance based on a weighted average with the previous 
best phase estimate, and a phase estimate from the new detection. If we use the actual variance for 
a phase estimate based on a single detection, then we do not get accurate results. This is because 
the variance for a single detection is large, so the weighted average does not accurately correspond 
to the exact theory. In order to make the theory based on weighted averages accurate, we need to 
assume an effective variance for the single detection, that is different from the actual variance. 

In the case where there is no variation in the syste m p hase, the phase variance after N detections 
is approximately 1/N (see the results given in Sees 5J5 and |6.6| ). Denoting the variance after N 



detections as A0^, and the effective phase variance from a new detection as A0 2 , the weighted 
average gives 

1 1 1 



AS% 



N - 



Ae 2 
i 



A&( 



= N + 1. 



(7.122) 



Therefore, in order to take account of a single detection via a weighted average, it can be assumed 
to be equivalent to a phase estimate with a variance of 1. This is, in fact, equal to the variance as 
estimated using (2(1 — cos ip)). 

Applying this to the case with a varying system phase gives 



Simplifying this to solve for AO 2 



AG 2 
we find 
1 



2 At 



A6 2 



(7.123) 



k 2 At/AO 2 
K 2 Ai/A0 2 



A6 2 



A6 2 



k 2 At/AO 2 



1 
1 

A9 2 . 



(7.124) 



Thus the approximate value of the variance is 

A6 2 re «VAt. (7.125) 
On average the time between detections is 1/N, so the approximate value of the variance should be 

A9 2 w k/VN. (7.126) 
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7.7 Results for Continuous Interferometric Measurements 

In order to verify this result, the equilibrium phase variance was determined numerically for a 
variety of parameters. In this case there are only two parameters, k and N . In the case of dyne 
measurements there was the additional parameter x describing how the latest results were weighted 
as compared to the previous results. In this case we do not have that parameter, as the phase 
estimates are not determined in that way. 

Similarly to the case of dyne measurements, these two parameters are related. Here the average 
time between detections is 1/iV, and the mean squared change in the system phase after time At is 
K 2 Ai. In the above theory the parameters k and At only appear through the product k 2 At. It is 
therefore the value of k 2 /N that matters, and if k and N are varied while keeping this product the 
same, the results will not change. It is therefore convenient to define 

K = 7F (7127) 



similarly to the case for continuous dyne measurements. 

The calculations were run for 10 5 detections (or 2 x 10 5 for the minimum value of K), and the 
phase error was sampled every detection after 10/K detections. This was done 100 times for each 
value of K. The equilibrium phase variance was determined in this way for the nearly optimum 
feedback scheme and in addition for a nonadaptive feedback scheme given by 

$ m = $o + toKtt, (7.128) 
where $o is a random initial phase. When the value of K was 1 or more this was modified to 

$ m = $ +TO7r/2, (7.129) 
to prevent & m being constant (modulo n). This is equivalent to the non-adaptive feedback in the 



single-shot case given by Eq. (6.57), and is analogous to heterodyne feedback for dyne measurements. 
The reason for the factor of K is that the effective number of detections used for the phase estimate 
is 1/K. To see this, note that the phase variance is approximately K. 

A minor problem with continuous adaptive measurements is that the number of probability 
coefficients Pmk(n m ) needed to determine the probability distribution for the phase rises indefinitely 
with the number of detections. The narrowing effect of the varying system phase, however, means 
that the probability coefficients fall approximately exponentially with fc. The probability distribution 
can therefore be approximated very accurately by keeping only a certain number of coefficients. For 
the results presented here all probability coefficients with a magnitude above about 10 -20 were used. 



All phase variances for the two feedback schemes are plotted in Fig. |7.10| . Note that, from the 
previous section, the analytically predicted value of the phase variance is K. As can be seen, the 
results for both cases are very close to this analytic result for the smaller values of K. For values of 
K close to 1 the results for the nonadaptive scheme are noticeably above the analytically predicted 
values. For large values of K above 1 the variance converges to 3 for both the feedback schemes. 
This is what can be expected, as the system phase is randomised between detections. This means 
that the measurements are equivalent to phase measurements with a single photon, for which the 
Holevo phase variance is 3. The feedback has no effect, as there is no information on which to base 
it on. 

To see the differences more clearly, the phase variances are plotted as ratios to the analytic 



results in Fig. 7.11. As can be seen, the results for both measurement schemes are very close to 
the analytic result for very small values of K, but differ increasingly for larger values. The adaptive 
scheme gives phase variances that are very close to, and slightly below, the analytically predicted 
values for smaller values of K. In contrast the results for nonadaptive measurements are all above 
the analytically predicted values (for K < 1). For large values of K the variance for both schemes is 
below K, as the variance is converging to 3. 

These results show that there is a small improvement in using an adaptive scheme over a non- 
adaptive scheme; however, when the system phase is varying slowly there will be very little improve- 
ment. This can be expected from the results in the single shot case, where there was very little 
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Figure 7.11: The phase variance as a ratio to the approximate analytic result of K. The results for 
adaptive and nonadaptive measurements are shown as the continuous line and dotted line respec- 
tively. 
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difference between the results for the adaptive and non-adaptive measurements when all photons 
were in one port. This means that we do not even get an improvement by a factor of \/2, as we do 
in the case of continuous dyne measurements on a coherent state. If the system phase is fluctuating 
rapidly we can get a significant improvement in the accuracy of the phase measurements. The 
maximum improvement is about 24% for K = 0.5. 



Chapter 8 

Conclusions 



The major results of this study can be described very succinctly: I have found the optimal input 
states and adaptive measurement schemes for both dyne and interferometric phase measurements. 



8.1 Input States for Dyne Measurements 

In Ch. ^ the problem of optimum input states for dyne measurements was considered. Rather than 
just considering optimisation for minimum intrinsic phase variance, the problem of optimisation for 
minimum phase variance under more general measurements where h(n) — cn p was considered. 
There are three different types of constraints on the states that were considered: an upper limit 
on the photon number, a fixed mean photon number, and squeezed states with fixed mean photon 
number. The optimum states for canonical measurements with these three constraints on the states 
have been determined previous to this study, as has the result for general measurements with an 
upper limit on the photon number. In Ch. || the set of results was completed with analytic results 
for fixed mean photon number and optimum squeezed states under general measurements. The 
complete set of results are summarised in Table 



!.l 



All of these results (except the exact analytic result for canonical measurements with an upper 
limit on the photon number) were verified by exact numerical calculation of the optimum states. For 
the case of canonical measurements on squeezed states it was found that the exact results did not 
agree with the approximate analytic result to order n~ 2 . In addition, corrections to this analytic 
expression derived from the results in [fl5| did not produce any better agreement. 

For the states with an upper limit on the photon number optimised for minimum phase vari- 
ance under general measurements, it was found that although there was good agreement with the 
analytic results, larger photon numbers were required for good agreement than was claimed in |S5| . 
This is because in |3^] the photon numbers required for good agreement were estimated from the 
approximation made in omitting a boundary condition. The main approximation made is actually 
the linearisation of the equations, which means that the photon numbers were underestimated in 
©■ 

An additional correction term of 3c 2 N 2p was found for this case. This term is specific to the 



Table 8.1: The asymptotic formulae for minimum Holevo phase variance for the canonical distribu- 
tion and general measurements under three different constraints on the states. The results in bold 
are those that are original to this study. 





Canonical 


General measurements 


Maximum photon number 


it' 2 

W 1 


2cN~p + \ Zl \ {2cp)y 3 N-^ 1+ Py 3 +3c 2 N- 2 p 


Mean photon number 


1.893606 
B» 


2cn p + v/cp(p + ljn-P/ 2 - 1 + 3c 2 n 2p 


Squeezed states 


log n+A 

4n 2 


2cn p + Vcp(p + l)fi-P/ 2 - 1 + 3c 2 n 2p 



186 



8.2. OPTIMUM DYNE MEASUREMENTS 



187 



Holevo phase variance, and is only significant for mark I measurements. For mark I measurements 
this term is required in order to obtain agreement to second order with the numerical results. Similar 
terms were found for the other two constraints upon the state. 

For the case of general measurements on states with fixed mean photon number, it was found 
that the results for both the general case and the squeezed state case agreed well with the analytic 
result. In addition the results for these two cases agreed very closely with each other; more closely 
than with the analytic approximation. In fact, when the states were compared, it was found that 
they were extremely close, although there were small systematic differences, particularly in the tails 
of the distribution. 

These results mean that it should be possible to experimentally produce very good approxima- 
tions of the states that are optimised for minimum phase variance under the various measurement 
schemes by using squeezed states. This is an advantage in theoretical work also, as squeezed states 
are more easily treated numerically. Only the two squeezing parameters need be kept track of, rather 
than the entire state, resulting in much faster calculation times. The exception to this is canonical 
measurements, for which the results for general and squeezed states have different scalings. The 
ratio between the two results only scales as logn, however, so the difference becomes significant only 
for very large photon numbers. 

8.2 Optimum Dyne Measurements 

The next issue considered was the problem of how to make optimum measurements of these states. 
Prior to this study it was known that the minimum phase variance introduced by an arbitrary dyne 
measurement scheme was that of an optimised squeezed state, with variance scaling as logn/n 2 . 
The best adaptive dyne measurement scheme previously considered, the mark II scheme, only gave 
scaling of n~ 15 . In this study I have found that it is possible to obtain results that are very close 
to the theoretical limit using a feedback phase estimate of 



If this is used without corrections, the variance as estimated from the variance of the samples 
is very close to the theoretical limit. Unfortunately this method of estimating the variance does 
not take account of very low probability results with large phase error. This contribution can be 
estimated from the values of A and B at the end of the measurement, and it is quite significant for 
this uncorrected phase feedback scheme. 

It is possible to correct the feedback scheme in order to minimise this problem. It was found that 
very good results were obtained by using a correction that brings the value of B v directly towards 
the estimated optimum value. This is used only at the very end of the measurement, and then only 
if \B V \ is too far above optimum. Using this correction, the introduced phase variance as estimated 
from the values of A and B is only about 4% above the optimum value, even for the largest photon 
number considered. 

It was found that the phase variance of the samples when this corrected feedback was used was 
slightly worse than that estimated from the values of A and B. This appears to be because of 
the large scatter in the photon number of the squeezed states in the POM. The data points with 
smaller photon number give a disproportionately large contribution to the phase variance, as the 
phase variance for optimum squeezed states rises very rapidly as the photon number is reduced. 
This is unavoidable, as any state with a small phase variance will have a large uncertainty in the 
photon number. This is the case both for the input state and the squeezed state in the POM. It 
is therefore reasonable to claim that the corrected phase measurement scheme is about 4% above 
what is theoretically possible. 

Under some circumstances, it is actually possible to surpass the theoretical limit based on the 
POM. This is possible if the input state is known accurately. For coherent states, it is possible to 



argCi-^A^) 



(8.1) 



with 




(8.2) 
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reduce the introduced phase variance to about a quarter of the theoretical limit. Unfortunately, for 
states with reduced phase uncertainty, this level of improvement does not appear to be possible. 
The improvement found for squeezed states was only around 10%. 

8.3 Time Delays 

All the adaptive feedback schemes considered are adversely affected by time delays. This was 
considered previous to this study in Ref. pfl, which predicted the excess phase variance for mark 
I and II measurements in a very approximate way. In this study the excess phase variance due to 
time delays was determined in a far more rigorous way, and verified by numerical calculations. 

In Ref. |35| it was predicted that the excess phase variance for mark I measurements was r/2. 
In this study it was found that this is the correct result, provided the phase estimate at the end 
of the measurement is the final value of the running phase estimate. This was found by repeating 
the derivation of Ref. |35| in a more rigorous way, and the same result was obtained. Fitting to the 
numerical results gave scaling constants close to the theoretical value of ^ , particularly for the larger 
photon numbers. There was some discrepancy for the smallest photon number considered, which 
can be expected as the approximation is in the limit of large a. 

If, rather than using the final value of the intermediate phase estimate as the final phase esti- 
mate, &rgA is used, the phase variance actually decreases with the time delay. This is because the 
intermediate phase estimates have a larger variance, resulting in the mark I measurements being 
closer to heterodyne measurements. 

It was found that the scaling of the excess phase variance for mark II measurements predicted 
in (3f| was correct, though the scaling constant was incorrect. When the derivation of Ref. |3f| was 
repeated more rigorously, it was found that completely different terms were obtained, casting some 
doubt on the method used in Ref. p5| . The terms found here are unusable, as they depend explicitly 
on the initial conditions of the integration. 

In order to avoid this problem, an alternative derivation was considered, that gives a lower limit 
to the total introduced phase variance with a given time delay, rather than the approximate excess 
phase variance due to the time delay. This result was approximately r/(8n), as opposed to the 
introduced variance of r/(2n) found in Ref. This result can be expected to give the introduced 
phase variance accurately if both the photon number and time delay are relatively large. In contrast, 
the result found in Ref. J35| was for the limit of small ar. 

Numerically it was found that the introduced phase variance converges to this theoretical limit in 
the three different cases with unsimplified feedback considered. For the case of simplified feedback, 
however, the phase variance was far higher. For larger time delays, rather than converging to the 
theoretical lower limit, the phase variance converged to the variance for heterodyne measurements. 
It was found to be possible to correct the feedback to improve on these results, but the variance was 
still much higher than for unsimplified feedback, and did not converge to the theoretical limit. 

This means that, if there is any significant time delay in the system, the simplified analog feedback 
will give far worse results than the exact, unsimplified feedback. This makes the more sophisticated 
feedback schemes considered in Ch. || more attractive, as one of the main advantages of the mark II 
measurement scheme was that it allowed the simplified feedback to be used. 

8.4 Input States for Interferometry 

In Ch. ^| optimum input states for interferometry were considered. When the total photon number 
is fixed, this problem is equivalent to the dyne case with an upper limit on the photon number. 
This problem has a simple solution, and there is a correspondingly simple solution in the case of 
interferometry. This state has a canonical phase variance scaling as N~ 2 , as compared to N^ 1 when 
all the photons are incident on one port. 

This state requires contributions from a large number of input photon eigenstates; however, it 
requires significant contributions from only about 10. This means that although the complete state 
will be very difficult to produce, it should be less difficult to produce close approximations of it. 
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Unfortunately, as the photon number is increased, more and more of these eigenstates are required 
to give a variance that is close to that for the exact state. 

An alternative state that has been considered in previous work, is one where equal photon 
numbers are incident on each input port. This state is the photon number eigenstate that has the 
highest contribution to the optimum state, and can therefore be considered to be an approximation 
of it. Unfortunately this state has a canonical phase variance that scales as TV -1 / 2 . This indicates 
that the phase uncertainty should scale as iV" 1 / 4 . This is an extraordinary result considering that 
previous work indicated that this state should give a phase uncertainty scaling as A^" 1 . 

This discrepancy was shown to be due to the tails of the phase distribution, which give the main 
contribution to the phase variance. The previous work, in contrast, only considered the central peak 
of the distribution. To compare these two states more thoroughly, these states were each evaluated 
under several different measures of the uncertainty. It was found that the phase uncertainty for the 
\j0) z state scaled as A^ 1 for all of the measures of the phase uncertainty except the square root of 
the variance (standard and Holevo). 

The optimal states gave smaller scaling constants under all of the measures except two: the 
inverse-of-maximal-value and the Fisher length. There are reasons to consider the results given by 
these measures as being slightly misleading, however. In practice, the \j0) z state will give results 
with small errors most of the time, but a small number of results with very large errors. This means 
that results obtained using this state must be carefully analysed to avoid problems due to the results 
with large error. For example, if we simply take the mean of a number of results obtained using this 
state, the uncertainty in this mean will scale as A -1 / 4 rather than A -1 . 

8.5 Optimum Interferometric Measurements 

In order to make interferometric phase measurements with the minimum possible phase uncertainty, 
the other two areas to optimise are the feedback phases used and the final phase estimate. It was 
shown in Ch. || that it is reasonably simple to determine the final phase estimate that gives the 
minimum uncertainty. Determining the optimum feedback phases to use is a far more difficult 
problem. 

Solving this problem for A photons is a minimisation problem with approximately 2 N ~ 2 inde- 
pendent variables. It is possible to solve it numerically; however, the rapidly increasing number 
of variables means that this is only feasible for photon numbers up to about 12. Even then there 
is a small possibility of a better result for a significantly different combination of numbers, as the 
numerical technique finds a local minimum, but does not prove that it is a global minimum. 

This numerical minimisation was performed for input states optimised for minimum phase vari- 
ance under canonical measurements. It is also possible to solve simultaneously for the feedback 
phases and the input state. This is only a slightly more difficult problem, as the number of state 
coefficients increases linearly with A. Improvements over the optimum input states were found for 
every photon number above 2; however, these improvements were only very small, less than 1%. 

An alternative approach is to, rather than trying to determine the feedback phase that will min- 
imise the final phase estimate, determine the phase that will simply minimise the phase uncertainty 
after the next detection. This approach was used in Ch. [| and this feedback scheme gave results 
that were the same as optimum for up to 4 photons with optimum input states. For more than 4 
photons the phase variances obtained were only slightly higher than for the numerically optimised 
feedback. For the maximum photon number for which the optimum feedback was determined, 12, 
the increase was only about 3.6%. 

This feedback scheme was tested on optimum input states for higher photon numbers up to 
1600, and it was found that the phase variance was only slightly higher than the canonical phase 
variance. That means that this feedback scheme must be very close to optimum for these larger 
photon numbers as well. Unfortunately, the ratio of the phase variance under this feedback scheme 
to the canonical phase variance increases fairly systematically with photon number, indicating that 
the introduced phase variance is not quite scaling as A~ 2 . It is quite likely that the scaling is 
log N/N 2 , as for dyne measurements, though it would require calculations at much larger photon 
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numbers to confirm this. 

For input states other than the optimum input state, this feedback scheme gave variances almost 
indistinguishable from the canonical phase variance. Of particular interest is the \j0) z state, for 
which the canonical phase uncertainty scales as TV -1 for measures other than the square root of 
the variance. For this state, the 2/3 confidence interval scales approximately as jV~ - 69 under this 
measurement scheme. This is not as good as the A -1 scaling for the canonical distribution, but it is 
an improvement on the result if all photons are in one port. Note, however, that this measurement 
scheme was designed with the aim of minimising the phase variance, not any other measure of phase 
uncertainty. This means that the feedback phases and phase estimates are not necessarily close to 
optimum for minimising the confidence interval. Minimising the confidence interval would require 
quite a different approach. 

Alternative phase feedback schemes were also considered in Ch. ^. A nonadaptive scheme was 
considered, where rather than using information from the detections to determine the feedback phase, 
the feedback phase was merely varied linearly. This is equivalent to heterodyne measurements 
for the single mode case, in that all values of the feedback phase are used with roughly equal 
probability. This scheme gave a phase variance scaling as A -1 for optimum input states, similarly 
to the heterodyne scheme for single mode measurements. When used on the state with all photons 
in one port (which has a canonical phase variance scaling as A -1 ) there was only a marginal increase 
in the phase variance over the adaptive scheme. 

An alternative adaptive feedback scheme based on phase estimates was found to give far poorer 
results than the feedback scheme of Sec. [3j| Although most of the results had small error, it gave a 
small proportion of results with very large error, resulting in a high phase variance. On average the 
phase variance was between that for the feedback scheme of Sec. |6.3| and the nonadaptive scheme. 

For 1 or 2 photons, the phase variances obtained by the feedback scheme of Sec. ^3 were identical 
to the phase variances for canonical measurements, for any input state. This means that, provided we 
have no more than 2 photons, it is possible to perform measurements as good as canonical. For 3 or 4 
photons, however, the phase variance was higher than canonical. As the feedback phases are optimum 
for these photon numbers, this means that it is not possible to make canonical measurements in 
general, even with perfect photodetectors and the best possible feedback phases. This result was 
checked by evaluating the entire range of feedback phases, in order to demonstrate that the feedback 
phases used gave a global minimum to the phase variance. 

If arbitrary states are considered, this feedback scheme is no longer optimum for 3 or 4 photons. In 
addition, it is possible to obtain phase variances that are below canonical. This is not a contradiction, 
as the variance is not necessarily minimised by having all the H nm equal to 1. For some states the 
optimal POM s hou ld have complex values of H nm . Using this corrected canonical POM, the feedback 
scheme of Sec. 6.3 no longer gives variances as small as canonical for all 2 photon states. 



8.6 Continuous Feedback 

The last area considered in this study was the problem of continuous phase measurements, where 
the phase is being varied and the aim is to follow this variation with the minimum possible excess 
uncertainty. For this problem, we cannot use the optimum input states considered in other chapters, 
as these are based on single-shot measurements with a limited number of photons. Instead, I 
considered a continuous coherent state for dyne measurements, and a state with all photons in one 
port for the interferometric case. These two cases are extremely similar, with both having canonical 
phase variances proportional to A -1 . In addition, continuous squeezed states for dyne measurements 
were considered. In all cases the phase variation considered was Gaussian diffusion. 

In the case of dyne measurements it was found that good results were obtained using arg^4 t 
feedback, similarly to mark I and II single-shot measurements. In the continuous case, the feedback 
simplifies to a very simple form, where the feedback phase is adjusted proportional to the photocur- 
rent. This form is even simpler than for the single-shot case. When the correct proportionality 
constant is selected, a minimum equilibrium phase variance is found that is proportional to a -1 . 
This is much poorer scaling than the case where the system phase is constant, where the variance 
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scales as a~ 2 (or n" 1 ). This is because, as the photon number is increased, data from a shorter time 
is used (to reduce the contribution to the variance from the varying system phase), with the result 
that the effective photon number that is used increases proportionally to a rather than a 2 . 

When heterodyne feedback is used, rather than adaptive feedback, the phase variance is increased 
by a factor of v2. This is less than the difference for the single-shot case for a similar reason as 
the scaling is different: when a more accurate measurement can be made, the time interval from 
which data is used is reduced. It was also found that the numerical results matched these theoretical 
predictions very accurately, particularly for the results with smaller phase variances. 

For the case of dyne measurements on continuous squeezed states, the situation is considerably 
more complicated. Rather than just a single constant that must be varied to find the minimum 
phase variance there are three. Nevertheless, it is still possible to obtain an approximate analytic 
result. It was found that the minimum phase variance should vary as a -4 / 3 . This is only slightly 
better scaling than the case for coherent states. 

Numerically it was found that the scaling of the minimum phase variance was very close to the 
a -4 ^ 3 scaling found analytically. The numerical results were well above the analytic result, however, 
on average more than twice. This appears to be due to the contribution from large phase error 
results, which were ignored in the analytic treatment. Numerically it was found that when the 
parameters were varied beyond the optimal values found, the variance increased due to these large 
error results. 

The case for interferometry is more difficult to treat, as it does not give a simple result for the 
feedback. The feedback used was based on minimising the variance after the next detection, similarly 
to the single-shot case. Nevertheless, it was found that it is possible to determine an approximate 
theory that agrees reasonably well the numerical results. Similarly to the dyne case with a coherent 
state, the phase variance is proportional to TV -1 / 2 , where N is the photon flux. When a linearly 
changing feedback phase was used (analogous to the heterodyne scheme), it was found that the 
phase variance is above that for the adaptive feedback, but the difference is only small, particularly 
for the smaller phase variances. This is as can be expected, as the difference is very small for large 
photon numbers in the single-shot case. 

8.7 Questions for Future Research 

Although the main aims of this project, finding the optimum states and measurement schemes 
for phase measurements, have been achieved, this project has also raised a number of unanswered 
questions that are possible future directions for research. 

8.7.1 Optimum Dyne Measurements 

In Ch. H a feedback scheme was found that produced dyne measurements extremely close to the 
theoretical limit. Although there are good qualitative reasons for expecting a feedback scheme of 
this type to be close to optimal, there is no rigorous justification for why this scheme works. This 
question is only of theoretical interest, as it would only tell us why a feedback scheme that has 
already been found works, rather than leading to any better phase measurements. 

Another question is raised by the results showing that the theoretical limit does not hold when 
different final phase estimates are used. An alternative limit was found in the case of coherent states, 
but there is no corresponding result for the case of squeezed states or more general states. When the 
feedback that gives results close to the theoretical limit for the usual argC phase estimates was used, 
it was found that it is only possible to improve the results slightly using better phase estimates. It 
is conceivable, though unlikely, that it is possible to do better using some other feedback scheme. 

A more promising way of improving upon the theoretical limit would be to use nonlinear elements. 
The basic reason why there is an introduced phase variance is because the phase is not measured 
directly. The best we can do is to measure a quadrature of the phase, which is proportional to 
the sine of the phase. The introduced phase variance is due to the fact that the sine function is 
nonlinear. In principle, it should be possible to compensate for this nonlinearity using nonlinear 
optics. 
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Another possible area for further research is to consider alternative measures of the phase uncer- 
tainty, as was done for interferometric measurements. For the Holevo phase variance, it is necessary 
to take account of very low probability, large error results in order to obtain an accurate estimate 
of the variance. This is unnecessary for other measures of the uncertainty with less emphasis on the 
tails. Nevertheless, I do not anticipate that this would give qualitatively different results. 

8.7.2 Optimum Interferometry 

There are a number of promising areas for future study in the area of interferometry. One is 
the question of whether the theoretical limit for dyne measurements also holds for interferometric 
measurements. There is a great deal of similarity between interferometric measurements and dyne 
measurements, as in both cases we are measuring the phase difference between two modes combined 
at a beam splitter. In principle it should be possible to take the large amplitude limit in a similar 
way as for dyne measurements, in order to get a similar theoretical limit. 

The main problem with taking this limit is that we cannot assume that one mode is of much 
larger amplitude than the other, as we can for dyne measurements. Some other complications are 
that there will be quantum correlations between the modes for interferometry, and the final phase 
estimate that is being used is not arg C. These factors mean that a theoretical limit would be very 
difficult to find, and it need not be the same as for the case of dyne measurements. 

Another area that can be considered is feedback where the feedback phase is selected so as to 
minimise the phase variance two or more detections in advance, rather than just one. This should 
improve on the feedback scheme that minimises the variance after the next detection. In addition, 
if only very minor improvements were obtained for feedback that minimises two or three detections 
ahead, then this would indicate that the feedback is very close to optimum. 

Unfortunately this approach would be very computationally intensive, as there does not appear 
to be any analytic solution for the feedback phase. The feedback phases would have to be determined 
numerically, making the calculations far more time consuming, though not as much as for the case 
where the feedback phases are chosen to minimise the final phase variance. 

Another promising direction is using other measures of the phase uncertainty. In particular 
it would be interesting to optimise the measurements for minimum entropic length, as this would 
correspond to maximum information. It is conceivable that minimising the entropic length one 
detection in advance would also minimise the final entropic length. This problem would also be very 
computationally intensive, as it would require numerical integrals to evaluate the entropic length. 

A more difficult problem is performing measurements that have zero error probability, as in 
Ref. J59|. The measurements considered in Ref. (SiJ are not physically possible, but it may be 
possible to perform measurements with zero error probability using feedback. The states considered 
in Ref. [f5S|] do not have a fixed total photon number in the two modes, as was assumed in this study, 
so this theory would have to be substantially modified in order to consider these states. 

I will lastly mention that it should be possible to generalise the theory considered in this study to 
phase measurements with an arbitrary number of modes. These are discussed in Ref. |]60| , and here it 
is the phase differences between the modes that we wish to measure. This is a difficult problem, as for 
example it does not seem to be possible to generalise the optimal states for two-mode interferometry 
to this case in a simple way. 



Appendix A 

Longer Derivations 



In this appendix I give some of the longer derivations that are too lengthy to present in the main 
text. 



A.l Perturbation Theory for Optimum Dyne States 



We wish to solve Eq. ( |2.9l| ) of Sec. |2.2.2| by perturbation theory, with the unperturbed Hamiltonian 
and perturbation term given by Eqs ( [2.94 ) and ( [2.95 ) respectively. The unperturbed solution is 



ft 



: exp 



^(n-nofMHj [/ 2 1/4 (n - n ) 



(A.l) 



where Hj are Hermite polynomials. This is the standard result for the harmonic potential, and 
is easily derived from the properties of Hermite polynomials given in 46 . Similarly to the other 
derivations given in Ch. |^, the boundary condition is ip(0) = 0. The unperturbed solution, and the 
more accurate perturbed solutions below, will approximately obey this boundary condition when 
\Zf2fiQ ^> 1. For this to be the case, p < 2 is required, in addition to no 3> 1. 
The energy eigenvalues are 

Ef ] = (2j+l)^. 



From perturbation theory we then have 
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where \j) is the state corresponding to ip^\n). In terms of number states 
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We can rewrite the perturbation as H\ = 6£ 3 , where 
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In terms of £ the unperturbed eigenstates are 
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1 /8 

The factor of / 2 has been omitted here, so these states are normalised when integrated with respect 
to f. 

The lowest energy eigenvalue and eigenstate, corresponding to the maximum value of v and 
therefore (cos <j>) , can be expressed as 



£ «4 0) + « 3 |o)--^=£ 
2 ^ 2 k=l 



(k\i 3 \o) 



/ / \ ,,(o)/ s & (&l£ 3 |0) ,(0) 



(A.9) 
(A.10) 



fe=i 



Here £ is the operator that transforms the state \j) in the same way as £ transforms the function 



The first four Hermite polynomials are 

H (0 = 1, 

H 2 (0 = 4£ 2 -2, 
# 3 (£) = 8^ 3 - 12C- 
Therefore the first four unperturbed eigenstates are 



It is therefore easy to show that 



2x/T 



V3 



,(0), 



or, in the alternative notation 



(A.ll) 
(A.12) 
(A.13) 
(A.14) 



(A.15) 
(A.16) 

(A.17) 
(A.18) 



(A.19) 



(A.20) 



From this it is evident that the only non-zero terms in the sums in ( A.9 ) are (1|£ 3 |0) = 3/(2v2) 
and (3|£ 3 |0) = y/3/2. This then gives the lowest energy eigenvalue and eigenstate as 



E ^E^-b^f-^, 
ip (ri) ~ ipo°\n) - 



3V4 



(0) 



(0) 



(A.21) 
(A.22) 



The correction term to the eigenvalue is of order n 2 , which is sufficiently small to be omitted. The 
eigenstate can alternatively be expressed as 



m « io) 



36 



*V2J2 4^3^ 



:|3). 



(A.23) 
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Therefore the expectation value of the photon number is 



(0| 



36 



1 - 



:(3| 



n 



10)- 



36 



1 - 



|3) (A.24) 



In order to evaluate this, note first that we wish to keep terms only up to first order in the pertur- 
bation, so we can simplify this to 



n « n + 



n 



36 



4^2/2 
36 

V^2 



(II- 



4^ 
b 

2vW 



I 



10) + (0| 
|0). 



(/2) 1 / 4 



36 



4V2/2 ' 4V37; 



From the above listing of the eigenstates it can be seen that 



e4 0) (0 



CIO) 



1 
1 

71 



|i>- 



= |3) 
(A.25) 

(A.26) 
(A.27) 



Therefore the only non-zero matrix element above is (1|£|0). Using this gives 

36 



n ~ no 



n 



As we are assuming p < 2, 



no « n 



4(/ 2 ) 3 / 4 
p + 2 

Vcp(p+ 1) 
p + 2 



,P/2 



P/2-1 



iy/cp(p+ 1) 

so the mean photon number is close to no, justifying the expansion around no- 

A. 2 Derivation for Optimum Squeezed States 



(A.28) 



(A.29) 



Next the result ( [2.149 ) of Sec. |2.3.3| will be derived in a more rigorous way. Recall that the number 
s tate c oefficients for squeezed states are given by an expression that depends on Hermite polynomials 
( 2.121 ) . Hermite polynomials satisfy the recursion relation j46| 



H n+ i(x) - 2xH„(x) + 2nH n -i(x) = 0. (A.30) 
This can be used to derive the recursion relation for number state coefficients: 

(n + l\a, CWn+ 1 - (n\a, 0/3 + (n- l\a, Qv-sfn = 0. (A.31) 
Rearranging this and squaring gives 

\(n + l\a, C>| V(« + 1) = \(n\a, C)| 2 /? 2 + |(n - l\a, C>| Vn - 2(a, (\n)(n - l\a, ()/3vVn. (A.32) 
Multiplying this by n k and summing gives 



£ |(n + l|a,C)|V(n+ V = £ \(n\a, C)|W + £ |(n - l|a, C)|Vn fc+1 
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-]T2( a ,C|n)(n-lkO/^ fe+1/2 
n=l 

oo oo oo 

2f3v £ n k +V 2 (a, (\n)(n - l\a, = (3 2 £ n k \(n\a, ()\ 2 + v 2 £ n^Kn - l|a, C)| 2 

n— 1 n— 1 ri—l 

oo 

- M 2^ (n+1)n fc| (n + 1 | aiC) |2 
n=l 

oo oo oo 

2^ ^ (n + l) fc+1 /2 (a , C |„) ( n + 1|«, C) = (3 2 £ n fc | (n\a, Q | 2 + v 2 + l) fc+1 1 (n|a, C) | S 



n=0 



n=l 
V 



^n(n-l) fc |(n|a,C)| 2 



(A.33) 



n=2 



If A; > 0, then the sums in the first and third terms on the right hand side can be extended to n = 0, 
giving 

OO 

2/3i/ J2( n + l) fc+1/2 («, C|n)(n + l|a, C> = /3 2 (n fe ) + f 2 ((n + l) fe+1 ) - p? (n(n - l) k ) . (A.34) 

In the following we wish to take k = \ — p, so k < 0. In this case the sums cannot be extended to 
zero, and in fact the additional terms would be infinite. In the following expansions, however, only 
the behaviour of the state near n — n is considered, and the contribution from n ~ is negligible. 
It is therefore reasonable to use Eq. (A.34) as a basis for the approximate expansions, despite the 
fact that some of the terms would be infinite if they were worked out exactly. 

Taking k = ^ — p and considering the deviation from the mean photon number gives 

oo 

2/3*/ J^i 71 + i)^", CI") (« + l|a, () « [3 2 ((n + An)-^ +1 / 2 '> ^ + v 2 ((n + l + An)-^ 1 / 2 ') 
n=0 

-/i 2 ((«-l + An)- (p - 1/2) ) - M 2 ((»-1 + An)-(P +1 / 2 )). (A.35) 
Expanding this in a series in An gives 



OC 



%-(P+J + l/2) 



(3 2 n 



n=0 
..2/ 



\n - l)-{v+i+i/z)\ + fr + J , 3 {? )! \i?(n + l)-(f+i-i/2) _ _ ^-(p+j-i/2)] | _ (A 36) 
/ (p-3/2)! L ' J J 

Now we have an expression that can be used to evaluate Eq. ( 2.145|) . Here the approximation 
that will be used for h(n) is c(n + l)~ p rather than cnT" 9 . This is reasonable, as the difference is of 
order n~( p+1 \ which is of higher order than will be considered here. Using the first three terms of 
the sum in Eq. ( A.36| ) gives 

oo 

201/ J^(n + l)- p (a, C|n) (n + l\a, () « f3 2 n- (p+1/2) - f i 2 (n - 1)-(p +1 /2) + ^ + y-to-l/*) 

n=Q 

- f i 2 (n - 1)-(p-V2) + (^!) |(p + 3 /2)(p + 1/2) (/3 2 n-^+ 5 / 2 ) - M 2 (n - 1)-(p+ 5 /2)) 

+(p + l/2)(p ~ V2) M(n + 1)-(p+ 3 / 2 ) - u 2 (n - i)-(p+3/2)J | (A.37) 

Note that the j = 1 term of the sum is zero, as (An) = 0. Using (An 2 ) = a 2 (/i — v) 2 + 2u 2 j/ 2 this 
equation becomes 

oo 

2/3u ^2(n + l)- p (a, £|n) (n + l\a, () « /3 2 rl^ p+1 / 2 ) -p 2 (n- l)-(f +1 / 2 ) + j/ 2 (n + 1)-(p-V2) 

n=0 
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»( w _1)-Cp-V2) + 



a 2 {[i - v) 2 2 o 
— — — + /T^ 2 



{(p 2 + 2 P+ J) [p 



%-(P+5/2) 



(n - 1)-(p+ 5 / 2 )) + (p 2 - i J + 1)-(P+ 3 / 2 ) - M 2 (n- l)-(*>+ 3 / 2 



(A.38) 



At this stage the main problem is to determine which terms should be kept. This depends on 
how no scales with n. Recall that if the state is optimised for minimum intrinsic phase uncertainty, 
then no oc log(n). That scaling cannot be assumed in this case; however, it is possible to make some 
general assumptions about the scaling of no. Firstly no should increase with n, so in order to obtain 
two orders in the approximation we will keep terms up to leading order divided by no- 

Secondly no should not increase as rapidly as n, as the squeezing should increase as the photon 
number is increased. This means that no/n 2 should be higher order than 1/no, and we can therefore 
omit terms of leading order times no/n 2 . We will, however, include terms of leading order times 
no/n, as we cannot assume that these are higher order. We will not obtain any terms of leading 
order times (no/n) 2 or any higher power, so we do not need to consider these terms. 

The third and fourth terms in (A.38) appear to be of order n~( p ~ 3 / 2 ^ /no. We can partially cancel 
these terms, however, by making the expansion 



(nil) 
This means that 



-Cp-i/2) =s -(p-i/ 2 ) 



<P-i) , (P + i)0>-i) +o(7r3) 



n 



2n 2 



(A.39) 



V 2 {n + 1)-(P-V2) _ M 2 (R _ 1} - (p-l/2) w {v 2 _ ^-ip-X/2) 



277 



-^ 2 + u 2 )(p-I)77-(P +1 /2). 



(A.40) 



Now v 2 — u 2 = —1, so together these terms are of order n ( p l / 2 ) , This is the leading order, and 
we will now omit terms known to be higher order than n - ( p_1/ ' 2 -'/no. This means that we will use 



J ,2(^ + i)-(P-l/2)_ /i 2 ( ^_ 1) -( P -l/2) 



-(p-i/2) _( I/ a +M 2)(p_ I)n"(P+ 1 /2). 



(A.41) 



Similarly for the terms on the third line of (A.38) there is the expansion 

(p+f) , (p+|)(p + f) 



(77±l)-^+ 3 / 2 ) =77-(p+ 3 / 2 



IT 



'TT- 
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+ 0(n~ d ) 



(A.42) 



so 



v 2 {n + 1)~(P+ 3 / 2 ) - u 2 (77 - l)-(f+ 3 / 2 ) « -^(?>+ 3 /2) - (^ 2 + [i 2 )(p + |)77-^+ 5 / 2 ). 



(ATS) 



Using this result, and omitting all terms known to be of higher order than n ( p l '' i > /no, Eq. ( A.38Q 
simplifies to 



20u J2(n + l)~ p (a,C\n)(n + l\a,Q « /3 2 77-( p+1 / 2 ' - M 2 (n - 1)"(p+i/ 2 ) -n^- 1 / 2 ) 



n=0 
.2 i .,2 



(^ 2 + u 2 )(p-I) 



Q 2 (u — v) 2 



_7j-(P+3/2) 



(A.44) 



Now using (3 no, /i ~ v 1 w n/(4no) and a(/i — i/) sa n /no, this simplifies to 



2/fc/YW l)-f(a,C|n)(n+ l|a,C> « -TT^ 1 ^ 1 - ^ + — 
' ^ n 2no 



n=0 



P(P + !) - 4 



(A.45) 
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Now we can expand 2[3v to give 



n V n 



4no V n / 



(A.46) 



Using this gives 



00 



_,,{i-^ + ^[p(p+i)-i]} 



n=0 

« j 1 

The two terms of np/n just cancel, giving the simple result 



^(n + l)-f(a,C|n)(n+l|«,C) 



This result is identical to that obtained simply using (n p ) 



1- -L- 

^0 1) 

n 2no 



p(p + l) 



2n 



(A.47) 



(A, 



A. 3 Perturbation Theory for Mark II Measurements 



perturbation theory 



From Sec. [4.2.2| , the phase variance for mark II measurements with a time delay is, according to 

(A.49) 



(4>= fdt f fdt fdt'Ul 



Using the result for ipf^ given in Eq. (4.8), the first term can be evaluated as 



1 ,1 
dt / dt' 



e ia(y/tn-Vi)0(O) + 



-dW(u) 



t' 4a(^I-y/¥) 



>u 



-dW{u) 



1 



3 4a(V«r-V*)^ 



3 4a(V«r-V*)^ 



(t,t') e 4a(2^u-Vt-VF) 



dt I dt' I du- 

/•I ft ft' iaii^/u-Vt-Vv) 

2 dt dt' du 



(A.50) 



Considering the first term, this simplifies to 



V^T , 1 



2 



2a 8a 2 



(*S?) 
1 



4a 2 8a 3 64a 4 



(A.51) 



The term that we take to be leading order here depends on how we take the limits. If we consider 
the limit of large a with a fixed value of v\, then the first term is largest. On the other hand, if we 
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consider the limit of small v\ the third term is the largest. As the variance should be small at time 
Vi, a large number of detections should have been made by this time. This means that we should 
have a 2 Vi 3> 1. In this limit, the first ter m abo ve is largest. 

Now consider the second term in Eq. (A.50). Changing variables to s = \ft' — \fu gives 



dt / dt' 



D — Sas 



dt 



Performing an expansion in s, 



[-2(VF- s )ds]e ia ^-^— E - 
v~ x - {Vt'-s) 2 

VF—^/vT 



dt , e 4 a (V¥-Vi) 



-8as 



ds 



(A.52) 



4 / dt / dt'e ia ^-^ 



dt 



dt , e 4a(V¥-Vi) 



' +0(a" 3 ) 



^Sa t'{8a) 2 

Next we substitute s = \/t — \ft! so that dt 1 = — 2{\fi — s)ds. This gives 



dt 



Vi—i/vi 



-4qs 



8a {Vt-s){8a)- 



+ 0(a~ 3 ) ds 



(A.53) 



(A.54) 



Again expanding in a series we obtain 



8 / dt 



Vt- 



1 



-4qs 



+ 



8a Vt(8a) 2 t(8a) : 



+ 0(s'a' i ) + 0{a~ i ) ds 



l Vl V(8a)(4a) 



0(a 



-3- 



dt 



4a 2 



0(a 



(A.55) 



In the limit of small v\ this simplifies to l/(4a 2 ). Note that the higher order terms diverge if the 
limit v i — > is taken for fixed a. However, as we are taking v\ such that a 2 i>i ^> 1, they will be 
smaller than the term given here. 



Thus we find that the first term of Eq. (A. 49) is 



(A.56) 



Next consider the second term in Eq. ( A.49| ). Evaluating this gives 



dt I dt' 



i' 



4 a 



ds 



-dW(v)- 



4a 

= 4Q( v /Ti-v / t 7 ) 



e 4 Q (2^r-v^-VF)^(0)^l) _ I ^ e 4a(2yW-Vt-V^) ( ^(0) )2ds 
t fS e 4a(V^-Vt) 



/It 



-dW(u) 



t ft' 4a(V«-Vt) o A , r- / — 7\ 

■= dW(v)-e ia{ ^-^dW(u) 



+ dt dt' 



4a 



ds 



min(t,s) e 4a(2V"-v / t-v / t 7 ) 



du + 2 



min(t,t') e 4a(2 v / «— Vt-W) 



du 



(A.57) 
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The first term simplifies to 



vi y/vi 1 



4a 2 8a 3 64a 4 



(A.58) 



As was discussed above, the first term here will be largest when we take v\ such that a 2 vi ^> 1. 
Now consider the second term in Eq. (A. 57). Taking the integral over i this simplifies to 

( 2 ^+ [ 1{l0SVl ~ logOe 4 "^^ ((< } ) 2 ) dt'. (A.59) 
Now we make the substitution s — \ft! — y/v± so dt' — 2(s + */ui)ds, giving 



2a 
2 



log 



e- ias ((^) 2 )[2( S + V %)d S ] 
1 log (l + JL) e- 4 « s ((^) 2 ) [(* + >T)d S ]. (A.60) 



Again, there is a slight problem, as the result obtained depends on how the limits are taken. The 
contributions to this integral will only be significant when s is less than about 1/a. As we should 
have a 2 v\ 3> 1, this means we should have s/ \fv~i <C 1. In this limit, we obtain 













a) 












a) 




1 


2a 2 


4a 3 



^s + ^— + 0(s 3 )je-^((^f)ds 

^ + v^F + 0(O )( ( ^ ))2 ) 



(A.61) 



Next we will consider the third term. To treat the minimum we must split the integral into three 
parts: 



1 f 1 /"*' 4a /-min(M) e 4a(2V^-v / t-v / t 7 ) 



(it / eff 



-dv 



du 



f Act [ v e^VTt-Vt-V^) 



dt I dt I — dv 

v 



dt [ dt 1 

vi Jt 



du 



*' 4a /"* ^"(Sv^-Vt-Vt 7 ) 
— dv 



du 



1 f 1 t* 4a e^V^-Vt-Vt 7 ) 



/ dt 



= -2 / dt / dt' / — dv 



4q 



„4a (2 v°tt- s/i- Vt 7 ) 



-du (a) 



1 f 1 /"*' 4a /"* e 4a(2 v / ^-Vt-v / t 7 ) 
-** ' J+/ ' -dw ' 



dt / dt' 

VI Jt 



-du. (b) 



Considering term (a) , this simplifies to 



-2 / (// / .//' / — dv 

v 



Vv—^/vJ p-So 



J,a(2^-Vi-V¥) ^ g 



IV — s 



2 dt dt 



v± J vi J vi 



-dv 



\fv V 



(A.62) 
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= -2 / dt I dt' I —dv 



1 ft i-Vt'-^/vT 



( 1 + f + 0(a- 3 )] e M^-Vt-VF) 



-4 I dt I dt' I " ' i — =1 + L + 0(a- 3 ) ) e 4«(^-yt) e -8 QS 

4 £ * £ * jf"" (?+^ + <^ + °<- 3 > ) ^-^- 8 "** 

J_ r 1 ,. /** ,./ / 1 3 

"2a 

1 



Vl 

=-i£ di (^ + 8s +o < a - 2 ))' (A - 63) 

Term (b) simplifies to 

fl fl ft e 4a(2 v ^7- v / t-Vt 7 ) 

-4a I dt I dt'[logt'-logt] / du 

fl fl pVi—JvT „-8as 

= -8a/ dt I dt'flogt'-logtl / — e 4a ^-^rfs 



./n \/i—3 



8a y dt y dfpogt'-iogt] y ^ + f j e -- e M^) ds 

di'[logt' - logt] f4= + — + °( s2 )) e ia{ ^-^ 

J vi Jt \Vt 8at J 



dt I 4= + A + ) / ( 3 + -T + °( s *) ) e~ 4QS ds 



Vt 8at J Jo V 2-\/t 



Therefore the third term of Eq. (A. 57) simplifies to 



2a 2 y fll \y/t 2at 
Next considering the fourth term of Eq. ( A.57| ), we obtain 



' ; / ' ' 0(a- 2 ) I dt. (A.65) 



fl ft ft' e 4Q(2 v ^T- v / i-v / F) (-1 fl ft e 4a(2 v ^T-v / t-v / t 7 ) 

2 / dt / dt' / j-g du + 2 / dt / dt' / — du 

J Vi J Vi J Vi J Vi 

- 4 L*L *L — ^ — d " 



dt h -—^7 / =-= « u 



,, v 2a 8«Vi Bl w 1 - 5 
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dt[ y/i- 



1 \ ["^t^V^ e~ Sas 



la J Jo (V~t~s) 2 



ds 



/ dt ( 4= + ^7 + 0(a~ 2 
2a 2 J V1 \Vi 2at ' 



The third and fourth terms therefore cancel to order a 3 . This means that 



(A.66) 



(A.67) 



Therefore we find that Eq. (A.49) simplihes to 

1 Wl 



~ 5? + & (<^°'> 2 > + 5 (T <«S?*e , > - VW<(«) 2 )) . (A.68) 
A. 4 Derivations for Continuous Measurements 

Evaluating Eq. (7.57) of Sec. |7.1.2| for the phase variance under continuous measurements, we find 



(e 2 (i)) =2 X 3 I J dvi J dv 2 e x{vi+V2 ~ 2t) J dW(m) J dW(u 2 )e 2 \ a ^ {ui+U2 - Vl - V2) \ 

* — oo — oo — oo — oo ' 

,tt V\ V2 t t i 

+8 X 3 \a\ 2 k 2 / J d Vl J dv 2 e x(vi+V2 - 2t) J d Ul f du 2 e 2 ^^ Ul+u ^ Vl - V2 '> fdW'(sx) jdW'{s 2 ) 

— oo — oo — o 

t t 

= 2 X 3 / dvi J dv 2 e x{vi+V2 ~ 2t) 

— oo — oo 

t t V 

+8 X 3 \a\ 2 K 2 J dvi J dv 2 e x< - vl+V2 - 2t ^> J du x J du 2 e 2lal ^ ul+U2 - vl - V2) J ds. (A.69) 



-OO — OO 

min(^i ,^2) 



du e 2 ' a ' V2x(2u~v 1 ~V2) 



-00 



-00 —00 



-00 —00 



max(iii ,1*2) 



Considering the first term first, we have 



t t min(«i,va) 

2 X 3 / d Vl / d U2 e X ^ 1+ " 2 - 2t) / due 2\ a \^(2u-v 1 -v 2 ) 



-00 — oo 



— 00 
v 2 



— oo — oo — 00 

+2 X 3 J dvx J dv 2 e x ^ +V2 ~ 2t *> J due^l^u-v^v,) 

— OO V\ —00 

t V\ V 2 

= 4 X 3 / dv x I dv 2 e x ^ +V2 - 2t) [ due^H^c^-v.-v,) 



-OO — OO 
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t V\ 

,3 



, ,* / dot / dv 2 e^ +v ^e 2 \ a ^ {v2 - v ^ 
\ot\ V%X 



X 



-oo — oo 
3 r P 2 x(vi-t) 



\a\V2x J X + 2M V2X 

— OO 

x 3 

|a|(2x) 3 / 2 (x + 2|a|V2^) 

X 



8|d 2 ' 



(A.70) 



For the approximation in the last line, it has been assumed that \a\ 3> yfx- 



Now considering the second term in (A.69), we obtain 



t t V\ V2 t 

s^MV / *, / / „„, / ^H^c— > / ds 

— oo — oc — oo — oo max(«i,« 2 ) 

t t Vi min(v 2 ,u 1 ) t 

= 8 X 3 |a| 2 K 2 / dvi [ dv 2 e x< - Vl+V2 ~ 2t ^ [ dm [ d M2 e 2 MV^(«i+«2-f i-« 2 ) f ds 



— OO —oc — OO — OO til 

t t min(vi,u 2 ) v 2 t 

*^/*/^ / a*/*^*-— ■>/* 

— OO — OO — CO til U2 

t Vl U\ V2 

- S*= W / *, / *, / / ^M^.".> (i - „,) (a) 

— oo — oo — oo — oo 

+8X " M V / *, / / fe e*l^<.— -)„ - ( b ) 

— OO — OO U\ — oo 

+8x 3 |a| 2 K 2 / dwi / dv 2 e x( - Vl+V2 ~ 2t) f du x [ du 2 e 2lal ^ 2x(ui+U2 ~ Vl - V2) (t - u 2 ) (c) 



- oo — oo — oo iti 

t t Vl 1)2 



+8 X 3 \a\ 2 n 2 / dvi dv 2 e x{vi+V2 ~ 2t) / dm du 2 e 2 ^^ ul+U2 ^ vl ~ V2 ^(t-u 2 ) (d) 



(A.71) 

Due to the difficult nature of the integration limits, we have obtained four terms. Evaluating each 
of these terms in turn, we firstly find for (a) 

t Vl 111 V2 



8 X 3 M 2 K 2 



8x 3 HV / d Vl J du x j dv 2 e x[vi+V2 ~ 2t) J du 2 e 2lalVlxiui+U2 - Vl - V2) '(t - ui) 

— OO —OO —oo 

Vl U\ 

— oo — oo 

dvi ( duie x( " 1+tll - 2t) e 2|a| ^ (M1 -" l) (i-ui) 



4.x 2 \a\ n 2 



2X 



-oo — oo 
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= 4x2 / d Vl e^-» (t + - v, 
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The approximation in the last line is accurate in the limit \a\ S> y/x- Turning to the next term, (b), 
we obtain 
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Again this is accurate in the limit \a\ » ^/x. This term is higher order than the result obtained for 
(a), and so can be ignored. 

Now looking at the third term (c), 
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16 |a| V2x 

This is again higher order than term (a), and so can be ignored. Now looking at the final term, (d), 
we find 
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For | a | 3> y/x, the only significant term here is 
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Using this, together with the results from Eqs (A. 70) and (A. 72), the total phase variance is 



<e 2 W) 



8 |«| 
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2* 



(A.77) 
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